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Pred. No. is the number of results predicted by chance to have a 
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and is derived by analysis of the total score distribution. 
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RESULT 1 
AF125103 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



AF125103 1798 bp mRNA linear HTC 22-MAY-2001 

Homo sapiens neuroendocrine specific protein c homolog mRNA, 
complete cds . 
AF125103 

AF125103.1 GI:5107001 
HTC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 1798) 

Zhang, Q.H., Ye,M. , Wu,X.Y., Ren,S.X., Zhao,M., Zhao, C. J., Fu,G., 
Shen,Y., Fan, H . Y . , Lu,G., Zhong,M., Xu,X.R., Han,Z.G., Zhang, J. W., 
Tao,J., Huang, Q.H., Zhou, J., Hu,G.X., Gu,J., Chen, S.J. and Chen, Z. 
Cloning and functional analysis of cDNAs with open reading frames 
for 300 previously undefined genes expressed in CD34+ hematopoietic 
stem/progenitor cells 

Genome Res. 10 (10), 1546-1560 (2000) 

20499367 

11042152 

2 (bases 1 to 1798) 

Ye,M. , Zhang, Q., Zhou, J., Shen,Y., Guan,Z., Wu,X., Fan,H., Mao, H . , 
Dai,M., Huang, Q., Chen, S . and Chen,Z. 

Human neuroendocrine specific protein c homolog mRNA, complete cds 
Unpublished 

3 (bases 1 to 1798) 

Ye,M. , Zhang, Q., Zhou, J., Shen,Y., Guan, Z . , Wu,X., Fan,H., Mao, H . , 
Dai,M., Huang, Q., Chen,S. and Chen,Z. 
Direct Submission 

Submitted ( 02-FEB-1999) Shanghai Institute of Hematology, Shanghai 
Second Medical University, Rui-Jin Hospital, 197 Rui-Jin Road II, 
Shanghai 200025, P. R. China 

Location/Qualif iers 

1. .1798 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 

/cell_type="CD34+ hematopoietic stem/progenitor cells" 

215. .814 

/ codon_start=l 

/product="neuroendocrine specific protein c homolog" 
/protein_id= M AAD39920. 1" 
/db_xref="GI: 5107002" 

/translation-"MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIV 
SA/TAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 
LGHVNCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLLQCSC 
YLERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE" 



ORIGIN 



Query Match 



25.7%; Score 986; DB 11; Length 1798; 



Best Local Similarity 86.1%; Preci. No. 2.4e-169; 

Matches 1360; Conservative 0; Mismatches 140; Indels 80; Gaps 21; 

GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I M I I I 

GT T GTT GAC C T C C T GT ACT G GAGAG ACAT T AAGAAGAC T G GAGT G GT GT T T G GT G C C AGC 3 07 

TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I | | | | | | I I I M II I I I I II I I I I II I I I I I I I I M I I I II I I I I I I I I I I I I 
CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 367 

TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I 
TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 427 

C AGAAAT CT GAT GAAG GC C AC C CAT T CAGGGCAT AT T T GGAAT C T GAAGT T G CT AT AT C T 2543 

I I I I I I I I M I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

C AGAAAT C AGAT GAAGG C C AC C CAT T CAG GGC AT AT CT GGAAT CT GAAGTT G CT AT AT CT 4 87 

GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2603 

I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 54 7 

GAACT C AGAC GCCT CT T CT T AGT T GAT G ATT T AGT T GAT T CT CT GAAGT T T GC AGT GT T G 2 663 

M | I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I i I I I I I I I I I I I 

GAACT CAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCT GAAGTT TGCAGTGTTG 607 

ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I 
ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 667 

CT GAT T T CACT - CT T C AGT GTT C C T GT T ATT TAT GAAC GGCAT C AGG CGC AAAT AGAT C A 27 82 

M | I M I I I I I I I I I I II I I I I I I I II I I I I I I I M I I II I II I I M I M I I I I I 

C T CAT T T CACT CCTT C AGT GTT C CT GT T ATT T A- GAAC GGCAT CAGGCAC AGAT AGAT C A 72 6 
TT AT CT GGGACTT GCAAATAAGAAT GTTAAAGAT GCT AT GGCTAAAAT CC AAGCAAAAAT 2 8 42 

I I I I II I I I I I I I I I I I I M I I I I I I I! II I I I I I I I I I I I I I I I I I I I I I I 

T TAT CT AG GACTT GCAAATAAGAAT GT TAAAGAT GCT AT GGCTAAAAT C C AAGCAAAAAT 7 8 6 

CC C T G GAT T GAAGC GT AAAGCT GAAT GAGAAAG C C T GAAAGAGT T AAC AAT AGAGGAGT T 2 902 

I | I I I II I I I I I I I I I I I II I I I II I I I I I t I I I I I I I I I I I I II I 

CC CT G GAT T GAAG C GCAAAG CT GAAT GAAAAC GC C CAAAATAAT T A GTAGGAGTT 841 

TAT C T T T AAAG GGGAT AT T CAT T T GAT T C CAT T GG G G AGGGT C AGGGAAGAACAAAGC C T 2 9 62 

I I I I I I I I I I I I I I I I I I II I I I M I I I II I I I I I I I I I I I I II I I I I II Ml 

CATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA-CCT 900 

T GACATT GCAGT GCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG- 3010 

I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I Ml M 

TGACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGT 960 

TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCT 3070 
I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
T GT GAGGAAAAAT T AC CT GT CT T GAC T GC CAT GT GT T CAT CAT C T T AAGT AT T GT AAGC T 1020 



Qy 


2304 


Db 


248 


Qy 


2364 


Db 


308 


Qy 


2424 


Db 


368 


Qy 


2484 


Db 


428 


Qy 


2544 


Db 


488 


Qy 


2604 


Db 


548 


Qy 


2664 


Db 


608 


Qy 


2724 


Db 


668 


Qy 


2783 


Db 


727 


Qy 


2843 


Db 


787 


Qy 


2903 


Db 


842 


Qy 


2963 


Db 


901 


Qy 


3011 


Db 


961 



3071 GCTATGTATGGATTTAAATCGTAATCATATTTGTTT — TTCCTGTATGAGGCACTGGTGA 3128 
I I I I I I I I II M I I I I I I I II II I I I I I I I III I II I I I I I I I I I M I I I 



Db 1021 G CT AT GT AT GGAT T T AAACC GT AAT CAT AT C T T T T T C CT AT C TAT CT G AGGC ACT G GT G G 108 0 

Qy 312 9 AT AAACAAAGAT C T GAGAAAGCT GT AT AT T AC AC T T T GT C G C AGGT AGT C T T G C T GT AT - 318 7 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M 

Db 1081 AATAAAAAAC C T GT AT AT T T T AC T T T GT T GC AGAT AGT C T T G C C GC AT C 112 9 

Qy 3188 T T GG G GAAT T G C AAAGAAAGT GGAG C T G AC AG AAAT AAC C C T T T T C AC AGT T T G 3241 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 1130 T T GG CAAGT T G C AGAGAT GGT GGAG C T AGAAAAAAAAAAAAAAAAGC C CT T T T CAGT T T G 1189 

Qy 3242 T GCACT GT GTAC GGT CT GTGTAGGT T GAT GC AGAT T TT CT GAAAT GAAA TGTTTAG 3297 

I I I I I M I I I I I I I I 1 I I I I I I I I I I I I I M I I I ! I I I I I I I I I I I I I I I I I I 

Db 1190 T GCACT GT GT AT GGT C C GT GT AGAT T GAT G C AGAT T T T CT GAAAT GAAAT GT T T GT T TAG 1249 

Qy 32 98 AC GAGAT CAT GC C AC CAAGGCAG GAGT GAAAAAG CTTGCCTTTCCTG GT AT GT T CT AG GT 3357 

I I I I I I I I I I II I I I I I 1 II III I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1250 AC GAGAT CAT AC C G GT AAAGC AG GAAT GACAAAG CTTG-CTTTTCTG GT AT GT T CT AGGT 1308 

Qy 3358 GTATT GT GAAATT TACT GTT GTATTAATT GCCAAT ATAAGTAAATAT AGATTAT ATAT AT 3417 

I I I I I I I II I I I I I I I I I I I I II I I I I I M I I I I I I I I I M I I I I I I I I I I I 
Db 1309 GT AT T GT GAC T T T TACT GTT AT AT T AAT T GC CAAT ATAAGTAAATAT AGAT TATA 1363 

Qy 3418 CTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTG 347 7 

1 I I I I I I t I I I I II I I I I I I I I I I I I I I I I I I M I 1 I I I I I I I M I I I I I I I I 
Db 13 64 - TAT GT AT AGT GT T T C ACAAAGCT T AGAC CT T T AC CT T - C C AGC CAC C C C AC AGT G CT T G 1421 

Qy 3478 ATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAG 3531 

I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1422 ATATTT CAGAGT CAGT CATT GGTTATACAT GT GT AGTT C CAAAGCACATAAGCTAGAAGA 1481 

Qy 3532 AAAC GTAC T T CT AGGC GCAC T AC CAT CT GTT T T C AAC AC GAAC CGAC G C CAT GCAAAC AG 3591 

I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I MMI I I I I I 
Db 14 82 AGAAAT AT T T CT AG GAGC AC T AC CAT C - GTT T T CAAC AT GAA AT GC CACACACATAG 1537 

Qy 3592 AAC T C C - T CAAC AT AAAC T T CAC T G CAC AG AC T T AC T GT AGT T AAT T T TAT CAC A — AAC 364 8 

I I I I I I I I I I I 1 II I I I I I I I I I I I I I I II I I I I I I I II I I M I I I I I I I 
Db 1538 AACT CCAACAACAT CAATTT CATT GCACAGACT GACT GTAGTTAATTTT GT CACAGGAT C 1597 

Qy 3649 TCTG GACT GAAT CT AAT GCTTCCAAAAA T GT T T GC AAAT AT CAAAC AT T GT T A 3701 

I I I M I I I I I I I I I I I I i I I I I I I II I I I I I I II I I I I I I II I I I I 

Db 1598 TAT GGACT GAAT CTAAT G CT T C CAAAAAT GT T GT T T GT T TGCAAAT AT CAAAC AT T GT T A 1657 

Qy 37 02 T GT AAGAAAAT AT AAAT GAC GAT T T AT ACAATT GT GGT T T AAG CT GT AT T G A 3753 

|| I I I M I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN 

Db 1658 T GCAAGAAATTATTAATTACAAAATGAAGATTTATACCATT GTGGTTTAAGCT GTACT GA 1717 

Qy 3754 AC T AAAT CT GT GGAAT G CAT T GT GAAC T GT AAAAG CAAAGT AT CAAT AAAGC T TAT AG AC 3813 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II 
Db 1718 ACT AAAT C T GT G GAAGG C AT T GT AAAC T GTAAAAG CAAAGT AT CAAT AAAG C T TAT AGAC 1777 

Qy 3814 TTAAAAAAAAAAAAAAAAAA 3833 

I I I I I I I I I I I I M I I 
Db 1778 C C AAAAC G AAAAAAAAAAAA 1797 



RESULT 2 
AF077050 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AF077050 1785 bp mRNA linear HTC 21-NOV-2002 

Homo sapiens neuroendocrine-specif ic protein C homolog mRNA, 
complete cols . 
AF077050 

AF077050.1 GI:4689147 
HTC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1785) 

Song,H., Peng,Y., Zhou, J. , Huang, Q., Dai,M., Mao, Y . , Yu,Y., Xu,X., 
Luo,B., Hu,R. and Chen, J. 

Human neuroendocrine-specif ic protein C (NSP) homolog gene 
Unpublished 

2 (bases 1 to 1785) 
Song,H. 

Direct Submission 

Submitted ( 10- JUL-1998 ) Rui-Jin Hospital, Shanghai Institute of 
Endocrinology, Molecular Medical Center, 197 Rui-Jin Road II, 
Shanghai 200025, P.R. China 

Location/Qualifiers 

1. .1785 

/organism="Homo sapiens" 
/mol_type="mRNA M 
/db_xref="taxon: 9606" 
/tissue__type="pituitary" 
214. .813 
/codon_start=l 

/product="neuroendocrine-specific protein C homolog" 
/protein_id="AAD27783 . 1" 
/db_xref="GI : 4689148" 

/ translation="MDGQKKNWKDKVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIV 
SVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 
LGHVNCTIKELRRLFLVDDLVDSLKFAVIJyrWFTWGALFNGLTLLILA^ 
IYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE" 



ORIGIN 



Query Match 23.9%; 
Best Local Similarity 85.5%; 
Matches 1238; Conservative 



Score 915.6; DB 11; Length 1785; 
Pred. No. 1.5e-156; 
0; Mismatches 144; Indels 66; Gaps 



17; 



Qy 


2304 


Db 


247 


Qy 


2364 


Db 


307 


Qy 


2424 


Db 


367 


Qy 


2484 


Db 


427 



GT T GT T GAC CT C CT C T ACT G GAGAGAC ATT AAGAAGACT G GAGT GGTGTTTGGTGC C AG C 

| | | | I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I M I I 

GT T GT T GAC CT C CT GT ACT GGAGAGAC ATT AAGAAGACT GGAGT GGT GT T T G GT GC C AG C 

TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 

| M I I I I I I I I I M I I I I 1 I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 

TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 

| I 1 I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 

CAGAAAT CT GAT GAAGGC C AC C CAT T CAG GG C AT AT T T GGAAT C T GAAGT T GCT AT AT C T 

I I I I I I I I I I I I I II I I I I I I M II I II I I M I I I I I I 1 I I I I I I I I I I I I I I I N I I 

CAGAAAT C AGAT GAAG GC C AC C CAT T C AG GG C AT AT CT GGAAT C T GAAGT T G CT AT AT CT 



2363 



306 



2423 



366 



2483 



426 



2543 



486 



Qy 


O C A A 


DO 


A Q "1 


Qy 


o a r\ a 




^ A 1 


Qy 


2664 


Db 


bu / 


Qy 


2724 


Db 


can 
bo / 


Qy 


2784 


Db 


TOT 


Qy 


2844 


Db 


TOT 

78 / 


Qy 


2904 


Db 


o 


Qy 


2964 


Db 


y Ul 


Qy 


3013 


Db 


OCT 


Qy 


3072 


Db 




Qy 


3132 


T\"U. 

Db 


lU/f 


Qy 


j iyx 


UD 




Qy 


3244 


Db 


1188 


Qy 


3300 


Db 


1248 



GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2 603 

I I I I I I M II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M II I I I II I I I I I I 
GAGGAGTTGGTTCAGAAGTACAGT7UVTTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 54 6 

GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2 663 

I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I 
GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 606 

ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

| | | | | | I I I I || I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 666 

CT GAT T T CACT C TT C AGT GT T C CT GT TAT T TAT GAAC GGC AT C AG GC G CAAAT AGAT CAT 2783 

II I I I I I I I I I I I II I I M II I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I II I 

C T CAT T T CACT CTT C AGT GT T C CT GT T AT T TAT GAAC G G CAT C AG GC AC AGAT AGAT CAT 726 
TAT CT G GGACT T GCAAATAAGAAT GT T AAAGATG CT AT GGC T AAAAT C CAAG CAAAAAT C 2 843 

Mill I I I M I I I I I I I I II I I I I I I I I I M I I I II M I I I II II I I M I I I I I II I I I 

TAT CT AGGACT T GCAAATAAGAAT GT T AAAGAT GCT AT GGC T AAAAT C CAAGCAAAAAT C 7 86 
CCT GGATT GAAGCGTAAAGCT GAAT GAGAAAGCCT GAAAGAGTTAACAAT AGAGGAGTTT 2903 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 

C CT G GAT T GAAGC GCAAAGCT GAAT GAAAAC GC C CAAAAT AAT T A GTAGGAGTTC 841 

AT CTT TAAAGG G GAT AT T CAT T T GAT T CCAT T GGG GAGGGT C AGG GAAGAACAAAGCCT T 2963 

I I I I I I I I I I I I I I I I I I I I I I II I I i I I I I I I I I II I I I I II I I I M II I I I I 

AT CTT T AAAG GGGAT AT T CAT T T GAT T AT AC GGG GGAGGGT C AGG GAAGAAC GAA- C CT T 900 
GAC AT T GCAGT G C AGT T T C AC AGAT CTT TAT T T T T AGCAAC GC AGT GT C 3012 

I M II I I M I I I I I I I I I I I I I I I I I I I I I Ml I M 

GAC GTT GCAGT GCAGT T T C AC AGAT C GT T GT T AGAT CTT TAT T T T T AGC CAT G CAC T GT T 960 

-TGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3071 

I I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I 

GT GAGGAAAAAT T AC CT GT CT T GACT GC C AT GT GT T CAT C AT CT TAAGT AT T GTAAGC T G 1020 

CT AT GT AT G GAT T TAAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAGGC ACT GGT GAAT A 3131 

I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I II I 

CT AT GT AT GGAT T T AAAC C GTAAT CAT A — TCTTTTTCCTATCTGAGGCACTGGTGGA — 1076 

AACAAAGAT CTGAGAAAGCT GTATATTACACTTTGTCGCAGGTAGT CTT GCT GTAT-TT G 3190 

| | Ml I I I I I I I I I II I I II I I I I I I I I I M I I I I I I Ml 
ATAAAAAACCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTG 1127 



GGGAATT GCAAAGAAAGT GGAGCT G AC AGAAAT AAC C CT T T T CAC AGT T T GT G 3243 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

GCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTG 1187 

CACT GT GT AC G GT CT GT GT AGGT T GAT GCAGAT T T T C T GAAAT GAAA TGTTTAGAC 3299 

| | | | M || I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I M I I I I 

CAC T GT GTAT GGT C C GT GT AGAT T GAT GCAGAT T T T CT GAAAT GAAAT GT T T GT T T AGAC 1247 

GAGAT CAT G C CAC CAAG G CAG GAGT GAAAAAGCT T GCCTTTCCT GGT AT GT T CT AGGT GT 3359 

I I I I I I I I II II I I I II I Ml II II II II I I I I II I I I I I M II I I 

GAGAT CAT AC C GGTAAAGCAGGAAT GACAAAGCT T G- CTT TT CT GGT AT GTT CT AGGT GT 1306 



Qy 

Db 



3360 AT T GT GAAAT T TACT GT T GT AT T AAT T GC CAAT ATAAGT AAAT AT AGAT TAT AT AT AT C T 3419 

I I I I I I I I I I I I I I I I I I I I II I 1 I I I I I I I II I I I I I II I I I I I I > I I I I M 
1307 AT T GT GACT T T TACT GT TAT AT T AAT T GC CAAT ATAAGT AAAT AT AGAT TAT AT AT G 1363 



Qy 3420 ATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGAT 3479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1364 T AT AGT GT T T C ACAAAG C T T AGAC CT TT AC CT T - C C AGC C AC C C C ACAGT G CT T GAT 1419 

Qy 34 8 0 ACT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAA 3533 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 142 0 AT T T C AGAGT C AGT CAT T GGT T AT AC AT GT GT AGT T C CAAAGC AC ATAAG CT AGAAGAAG 14 79 

Qy 3534 AC GT AC T T C TAGGC GCACT AC CAT CT GT T TT CAACAC GAAC C GAC GC CAT GCAAACAGAA 3593 

I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1480 AAAT AT T T C T AGGAGCACT AC CAT CT GT T TT CAAC AT GAA AT GC C ACACACAT AGAA 1536 

Qy 3594 CT C C - T CAACAT AAACT T C AC T G CAC AGACT TACT GT AGT T AAT T T TAT CAC AAACT C T - 3651 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I III 
Db 1537 C T C CAACAACAT CAATT T CAT T G CAC AGACT GACT GT AGT T AAT T T T GT C AC AGAAT C T A 1596 

Qy 3652 GGACTGAATCTAATGCTTCCAAAA AT GT T T GCAAAT AT CAAAC AT T GT TAT GT AAG 37 07 

I I I I I I I I I I II I I I I I I I I III M I I I I I I I I I I I M II 

Db 1597 T GGC T GAAT CT AAT GC T C CAAAAAT GT T GTT T GT T G CAAAT AC CAACAT T GT TAT GC C AG 1656 

Qy 3708 AAAATATA 3715 

III I II 

Db 1657 AAATTTTA 1664 
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AK034902 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 



AK034902 3533 bp mRNA linear HTC 18-SEP-2003 

Mus musculus 12 days embryo embryonic body between diaphragm region 
and neck cDNA, RIKEN full-length enriched library, clone : 9430059L06 
product :RETICULON 4 (NEURITE OUTGROWTH INHIBITOR) (NOGO PROTEIN) 
(FOOCEN) (GLUT4 VESICLE 20 KDA PROTEIN) homolog [Rattus 
norvegicus], full insert sequence, 
AK034902 

AK034902. 1 GI : 2 6084268 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 
Genome Res. 10 (10), 1617-1630 (2000) 
20499374 



PUBMED 
REFERENCE 
AUTHORS 



11042159 
3 

Shibata,K., Itoh,M., Aizawa,K. 



Nagaoka,S., Sasaki, N., Carninci,P. 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Konno,H., Akiyama, J. , Nishi,K., Kitsunai , T . , Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura, S . , Hazama,M., Nishine,T., Harada,A. , 
Yamamoto, R. , Matsumoto ,H., Sakaguchi , S . , Ikegami, T . , Kashiwagi , K. , 
Fujiwake, S. , Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J. 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki , Y . 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multi capillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3533) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P. 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W . , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A. , Murata,M. , 
Nakamura, M. , Nishi,K., Nomura, K. , Numazaki,R., Ohno,M. , Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi , F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted (16- JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc . riken . go . jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel: 81-45-503-9222, 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN . 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL:http: / /genome . gsc . riken. go. jp/ 
URL : http : / / f antom. gsc . riken . go . j p/ . 

Location/ Qualifiers 

1. .3533 

/organism="Mus mus cuius" 



misc feature 



/mol_type= ,, mRNA" 
/strain="C57BL/6J" 
/db_xref="FANTOM_DB: 9430059L06" 
/db_xref ="MGI : 2399112" 
/db_xref="taxon: 10090" 
/clone="9430059L06" 

/tissue_type=" embryonic body between diaphragm region and 
neck" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="12 days embryo" 
1. .3533 

/note="RETICULON 4 (NEURITE OUTGROWTH INHIBITOR) (NOGO 
PROTEIN) (FOOCEN) ( GLUT 4 VESICLE 20 KDA PROTEIN) homolog 
[Rattus norvegicus] ( SWISSPROT | Q9 JK11 , evidence: FASTY, 
95.8%ID, 100%length, match=1068 ) " 



ORIGIN 



Query Match 20.2%; 
Best Local Similarity 78.3%; 
Matches 1190; Conservative 



Score 774.2; DB 11; 
Pred. No. 6.6e-131; 
0; Mismatches 258; 



Length 3533 ; 
Indels 71; Gaps 



19; 



Qy 2304 GT T GT T GAC CT C C T CT ACT GGAGAGAC AT T AAGAAGACT G GAGT GGTGTTTGGTGC C AG C 2363 

I I I I I I I I I I I I I 1 I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 740 GT T GT T GAC CT C C T GT ACT GGAGAGAC AT TAAGAAGACT G GAGT G GT GTT T GGT GC C AG C 799 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



2364 TT GTT CCTGCTGCTCTCGCTGACAGTATTCAGCATTGT GAGT GTAACGGCCTACATTGCC 2423 

II I I I I I I I I I I I II I I I I II I I I I! I I I I I I I I I I I I I I I I I I I I I I II I I II I 

800 TTATTCCTGCTGCTGTCTCTGACAGT GTT CAGCATTGTCAGT GTAACGGCCTACATTGCC 859 

2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINI 

860 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAAGCTATC 919 

2484 CAGAAAT CT GAT GAAGGCCACC CATT CAGGGCATATTT GGAAT CT GAAGTT GCTATAT CT 2543 

MINIM II I II M II I II II I I II I I M II II I II II I II II II I M M I I I I M 

920 CAGAAAT CAGAT GAAGGCCAC CCAT T CAGGGCATATTT GGAAT CT GAAGTT GCCAT AT CA 97 9 

2544 GAGGAGT T GGT T C AGAAGT AC AGCAAT T CT GCT CT T G GT CAT GT T AACT G CACAAT AAAA 2 603 

II I II II I I II I II I I II II II II I I I II I I I II II I I II III I I II I I I I I I I 

98 0 GAG GAAT T GGT T CAGAAAT AT AGTAAT T CT GCT CT T G GT CAT GT GAACAGCACAAT AAAA 1039 

2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

Ml I II M II I I I II II I II II I II I II II II II II I II II I II II II I II II I I 

104 0 GAATTGAGGCGTCTCTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 1099 

2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

II II I I I I II I II II I II I I II I I II II II I I I II I I II II II I I I Mill III 

1100 ATGTGAGTATTTACTTACGTTGGTGCCTTGTTCAATGGTTTGACACTACTGATTTTAGCT 1159 

2724 CT GAT T T CACT C T T CAGT GT T C CT GT TAT T TAT GAAC G GC AT C AGGC GCAAAT AGAT CAT 27 83 

Mill II II I II II I II II II II I I I I I I M I I II II I I II II II II II I I II I I I 

1160 CT GAT CT CACT CT T CAGT AT T C CT GT TAT AT AT GAAC G GC AT C AGGC GCAGAT AGAT CAT 1219 

27 84 TAT CT GGGACT T G CAAAT AAGAAT GT T AAAGAT G CT AT G G C T AAAAT C C AAG CAAAAAT C 2 843 

Mill II I II I I II I I I I I I Mill II II I I II I I II M II M M I I I I I I I I 

1220 TAT C TAG GACT T G CAAACAAGAGC GT T AAGGAT GC CAT G GC CAAAAT C C AAG CAAAAAT C 1279 



Qy 2844 C C T G GAT T GAAG C GT AAAGCT GAAT GAGAAAGC CT GAAAGAGT T AACAAT AGAGGAGT T T 2903 

I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 
Db 1280 C C T G GAT T GAAGC G CAAAGC AGAAT GAAAAG GC C C CAAAC AGT AGAC A TTC 1330 

Qy 2904 AT CT T TAAAGG G GAT AT T CAT T T GAT T C CAT T G - G G GAGG GT CAGG GAAGAACAAAGC C T 2962 

I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II I I I III I I 
Db 1331 ATCTTTAAAGGGGACACTCCCTTGGTTACGGGGTGGGCGGGTCAGGGGTGAGCCCTGGGT 1390 

Qy 2963 T GAC AT T GC AGT GC AGT T T C AC AGAT CTT T AT T T T TAG CAAC GCAGT GT C T G AGGAAAAA 3022 

III I I I I I I I I I I M I I I I I II I I I II I I I I I I I I 

Db 1391 GGCCGTGCAGTTTCAG TTATTTTTAGCAGTGCACTGTTTGAGGAAAAA 1438 

Qy 3023 TGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGA 3082 

I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 1439 TTACCTGTCTTGACT-TCCTGTGTTTATCATCTTAAGTATTGTAAGCTGCTGTGTATGGA 14 97 

Qy 308 3 TTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCT 3142 

II I I I I I I I I I I I I I I I I I I I I I MM I I I I I I II I II 

Db 1498 TCT-CATTGTAGTCATACTTGTTTTCCCCAGATGAGGCACTTGGTGAATAAAGGATG-CT 1555 

Qy 3143 GAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGGAATTGCAAA 3202 

I I II I II II I III I I III I I I I I I I II II II I I I I I M I I I II I II M 

Db 1556 GGGAAAACTGTGTGTTATATTCTGTTGCAGGTAGTCTGGCTGTATTTGGAAAGTTGCAAA 1615 

Qy 3203 GAAAGTGGA GCTGACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCT 3258 

II I I I I I I I Mill I II II II I I I I I I I 

Db 1616 GAAGGTAGATTTGGGGGCAGGAAAAACAACCCTTTTCACAGTGTACTGTGTTTGGTTGGT 1675 

Qy 3259 GT GT AG GT T GAT GC AGAT T T T CT GAAAT GAAAT GTT T AGAC GAGAT CAT G C C AC C AAGG C 3318 

II I I I I I M I I I I I I I I I I I I I II II I I I I I I I I I I I I I 

Db 1676 GT AAAACT GAT GC AGAT T T TT CT GAAAT GAGAT GTT T AGAT GAG — CATACTACTAAAGC 1733 

Qy 3319 AGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTG 3378 

II I I I I I I I I I II II I I I M I II I I I I I I I I I I I I II I I 
Db 1734 AGAGT GGAAAAAT CT GT C — TTTATGGTATGTTCTAGGTGTATTGTGATTTACTGT 17 87 

Qy 337 9 TAT T AAT T G C C AAT AT AAGT AAAT AT AG AT TAT AT AT AT C TAT AT AT AGT GT T T C AC GAA 3438 

I I I I II I I I I II I M I I II I I II I I I I I II II I M I I I II I II II II 

Db 1788 — T AGAT T G C C AAT AT AAGT AAAT AT AGA CAT AAT C T AT AT AT AGT GT TTC AC AAA 1841 

Qy 3439 GCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACTTCTGTCATGGGTTTTA 34 98 

I I I I I I I M I I I I I I M I I II I II I II II I I I M I I II I I I I I I I I 

Db 1842 GCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTTGACCTCTGAGTCATTGGTTATA 19 01 

Qy 3499 TGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATC 3558 

II I I I I I I I I I I II I I I I M I I II I I I I I I I I I I I 

Db 1902 - C AGT GT AGT C C CAAGCACAT AAACT AGGAAGAGAAT GT AT T T GT AG GAGC G CT AC C AC C 1960 

Qy 3559 T GT T T T CAAC AC GAAC C GAC G C CAT G CAAAC AG AAC T C CT C AAC AT AAAC T T C AC T G C AC 3618 

I I I I I I I II I I I I I I I I II I II I I I M I M 

Db 1961 TGTTTTCAA GAGAACAT AGAAC T C CAAC GT AAC C GT CAT T T CAA 2004 

Qy 3619 AGAC T T AC T GT AGT T AAT T T TAT C AC AAACT CT GGACT GAAT C T AAT GC T T C CAAAAAT G 3678 

I I I I I I I M I I II II I I M I M I I I II I I I II I I II I M I I I 

Db 2005 AGAT T T AC T GT AT GT A TAGTTGATTTTGTGGACTGAATTTAATGCTTCC — AAATG 2 058 

Qy 3679 TTTGCAAATAT CAAACATTGTTAT GTAAGAAAATAT -AAATGACGATTTATACAATTGTG 3737 



Db 2059 T T T G C AGT T AC C AAAC AT T GT T AT G C AAG AAAT C AT AAAAT GAAGAC T TAT AC CAT T GT - 2117 

Qy 3738 GTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGTATC 37 97 

II I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2118 GTTTAAGCTGTATTGAATT — AT CT GT G GAAT GCAT T GT GAACT GT - AAAG CAAAGT AT C 2174 

Qy 379 8 AATAAAGCTTATAGACTTA 3816 

I I II I II I I I I I I I I I I II 
Db 2175 AATAAAGCTTATAGACTTA 2193 



RESULT 4 

AL573494/c 
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KEYWORDS 
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AL573494 956 bp mRNA linear EST 31-MAY-2003 

AL573494 Homo sapiens PLACENTA COT 25-NORMALIZED Homo sapiens cDNA 
clone CS0DI051YB11 3-PRIME, mRNA sequence. 
AL573494 

AL573494.2 GI:31294840 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 956) 

Li,W.B., Gruber,C, Jessee, J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished (2001) 

On Feb 16, 2001 this sequence version replaced gi: 12932794. 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref Qgenoscope . ens . f r , Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 1423. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/ cluster . cgi?seq=CSODI051CA06NPl&cluster=142 3 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI051CA06NP1 . 

Location/Qualifiers 

1. .956 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DI051YBll" 

/tissue_type="PLACENTA COT 25-NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25-NORMALIZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and -cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 



ORIGIN 



Query Match 17.4%; Score 668.6; DB 9; Length 956; 

Best Local Similarity 88.7%; Pred. No. 1.5e-lll; 



Matches 795; Conservative 3; Mismatches 67; Indels 31; Gaps 6; 

Qy 2304 GT T GT T GAC C T C CT CT AC T GGAGAGACAT TAAGAAGAC T GG AGT GGTGTTTGGT GC C AG C 2363 

I I I I I I II I I I 1 I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 87 8 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 819 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I I I I I I I I I I I I II I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 818 CTATTCCTGCTGCTTTCATTRACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 759 

Qy 2 424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 24 83 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 758 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 699 

Qy 2484 CAGAAATCT GAT GAAGGC CAC C CATT CAGGGCATATTT GGAAT CT GAAGTT GCTATAT CT 2543 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 
Db 698 CAGAAAT CAGAT GAAG GC CAC C CAT T C AGGG CAT AT C T GGAAT CT GAAGT T GC T AT AT CT 639 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2 603 

I I I I I I I I I I I I I I I II I I II I I II I I I II I I I I I I I I I I I II I I I I I I I I I I II I 
Db 63 8 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 57 9 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 578 GAAC T CAGG C GC C T CTT CTT AGT T GAT GAT T T AGT T GAT T C T CT GAAGT TT G CAGT GT T G 519 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I II I I I II I I M I 

Db 518 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 4 59 

Qy 2 72 4 CT GAT TT CACT CTT CAGT GTTCCTGTTATT TAT GAAC GGCATCAGGCGCAAATAGAT CAT 27 83 

II I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I II I I I I I II I I 

Db 458 CT C AT T T CACT CTT CAGT GTT C CT GT TAT T TAT GAAC GG CAT C AGGC ACAGAT AGAT CAT 399 

Qy 2784 TAT CT GGGACTT GCAAATAAGAAT GTTAAAGAT GCTAT GGCTAAAAT CCAAGCAAAAAT C 2843 

Mill I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 398 TAT CTAGGACTT GCAAATAAGAAT GTTAAAGAT GCTAT GGCTAAAAT CCAAGCAAAAAT C 339 

Qy 2844 C CT GGATT GAAGCGTAAAGCTGAAT GAGAAAGCCT GAAAGAGTTAACAATAGAGGAGT TT 2903 

I I I I i I I I I II II I I II M I I I I I II I I I I I I I I I I II I II I I I I 

Db 33 8 C CT GGAT T GAAGC GCAAAGCT GAAT GAAAAC GC C CAAAATAATT A GTAGGAGTTC 284 

Qy 2 904 AT CTT TAAAGGG GAT AT T C ATT T GAT T C CAT T G G G GAGG GT CAG GGAAGAACAAAG C CT T 2963 

I I I I I I II I I II I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I 
Db 2 83 AT CTT T AAAGGGGAT AT T C ATT T GAT T AT AC GG GGGAGGGT C AG GGAAGAAC GAA- C C T T 225 

Qy 2964 GAC AT T GC AGT GC AGT T T CAC AGATCTTTATTTTTAGCAACGCAGTG-T 3011 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I 

Db 224 GAC GT T GCAGT G CAGTT T CACAGAT C GT T GTTAGAT CTT TAT TTTTAGC CAT G CACT GTT 165 

Qy 3012 CTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3071 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 164 GTGAGGAAAAATTACCTGT CTT TACT GC CAT GT GTT CAT CAT CTT AAGTATTGTAAGCTG 105 

Qy 3072 CT AT GT AT GGAT T TAAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAGGC ACT G GT GAAT A 3131 

I I I I I I I I I I I I I I I II 11:1111111 I I I I I I I II I I I I I I I I I I I I I I I 
Db 104 CT AT GT AT GGAT T TAAAC C G YAAT CAT A — TCTTTTTCCTATCTGAGGCACTGGTGGA — 49 



Qy 3132 AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT 3187 

I I III III I III I I I I I I I I I I I I I II II I : I 

Db 4 8 ATAAAAAACCTGNANATTNNNACTTGTCGCAGATAGTCTGGCCGCMT 2 
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CB204418 896 bp mRNA linear EST 05-FEB-200 

AGENCOURT_1127 6017 NIH_MGC_135 Mus musculus cDNA clone 
IMAGE: 30138586 5 1 , mRNA sequence. 
CB204418 

CB204418.1 GI:28241848 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 896) 
NIH-MGC http : / /mgc . nci . nih . gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Dr. David Rowe 
cDNA Library Preparation: Invitrogen Corp 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : / /image . llnl . gov 
Plate: NDAM0041 row: k column: 11 
High quality sequence stop: 686. 
Location/Qualifiers 
1. .896 

/organism="Mus musculus" 
/mol_type="mRNA" 
/db_xref="taxon: 10090" 
/clone="IMAGE: 30138586" 
/lab_host="DH10B (phage-resistant ) " 
/ cl one_l ib= "NI H_MGC_1 3 5 " 

/note="Vector: pCMVSport 6 . 1 ; Site_l: EcoRV; Site_2: NotI; 
Normalized full-length enriched library from pooled mouse 
embryonic limb, maxilla and mandible, day 12.5, 13.5, 
14.5, and 15.5 (size selected for the 0.5-1 kb fragments) 
Cloned directionally, priming method: Oligo-dT. cDNA 
enrichment: >lk bp, Average insert size 1.6k bp. 
Normalization (Cot value): 7.5 kb. Priming sequence: 
5 1 GACTAGTTCTAGATCGCGAGCGGCCGCCC (T) 3 1 Tissue contributed 
by, David Rowe. Library constructed by ResGen, Invitrogen 
Corp. " 



ORIGIN 



Query Match 16.9%; Score 646.8; DB 14; Length 896; 

Best Local Similarity 85.3%; Pred. No. 1.4e-107; 

Matches 761; Conservative 0; Mismatches 117; Indels 14; Gaps 3 



Qy 1856 AAGAGAAAGT GAAACAT T T T C AGAT T CAT CT C C GAT T GAGAT TAT AG AT GAGT T C C C GAC 1915 

I II I I I I I I I I I I I I II I I I I I M I I II I I I I I I I II I II I I I I II II 

Db 1 AT GAAAGAAGT GAACAT T T T C C GAT T CAT CT C C CAT T GAGAT AAT AGAT GAGT T T C C C AC 60 

Qy 1916 CT T T GT C AGT T CT AAAG C AGAT TCTTCTCC T ACAT TAG C C AG G GAAT AC ACT GAC C TAG A 1975 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AT T T GT C AGT GCTAAAGAT GAT T C T C C T AAGGAGT AC ACT GAC CT AGA 108 

Qy 197 6 AGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGC 2 035 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 109 AGTATCCAACAAAAGTGAAATTGCTAATGTCCAGAGCGGGGCCAATTCGTTGCCTTGCTC 168 

Qy 2036 AGGAT T G C C C CAT GAC CT T T C T T T CAAG AGT AT ACAAC CT AAAGAGGAAGT T CAT GT C C C 2095 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I II I I I I I I I I II I 

Db 169 AGAAT TGCCCTGT GAC CT T T CT T T CAAGAAT ACAT AT C CT AAAGAT GAAGC ACAT GT CT C 228 

Qy 2096 AGAT GAGT T C T C CAAAGAT AGG GGT GAT GT T T CAAAG GT G C C CGT ACT GC CT C C AGAT GT 2155 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 229 AGAT GAAT T CT C CAAAAGT AG GT C CAGT GT AT C TAAGGT G C C CTT AT T G C T T C CAAAT GT 28 8 

Qy 2156 TTCTGCTTT GGAT G CT CAAGC AGAGAT AG GCAG CAT AGAAAAAC C CAAAGT T C T T GT GAA 2215 

II I I I I I I I I I I III! Ill II 11 I I I I I I I I I II I I I I I I I I I I III 

Db 2 89 TTCTGCTTTG GAAT CT CAAAT AGAAAT GG GCAACATAGT TAAAC C CAAAGT ACT T AC GAA 34 8 

Qy 2216 AGAAGCC GAGAGAAAAC T T C CT T CT GAT AC AGAAAAAGAG C GAAGAT CT C C AT CT GCT AT 2275 

I I I I I I I I I I I I I II I I I I I I I I M I II I I I I I I I I Mill I I III I 
Db 349 AGAAGC AGAGGAAAAAC T T C C T T CT GAT AC AGAGAAAGAGGAC AGAT CC CT GACAGCT GT 4 08 

Qy 22 7 6 AT T T T C AGCAGAG C T GAGT AAAAC T T C AGT T GT T GAC CT C CT CT ACT GGAGAGAC AT TAA 2335 

III I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 4 09 AT T GT CAG CAGAGCT GAATAAAACT T CAGT T GT T GAC CT C CT GT ACT GGAGAGAC AT TAA 4 68 

Qy 2336 GAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAG 2395 

I I I I I I I II 1 I I I I I I I I I I I I I I I I II I I M I I I I I I II I II MINIM I I I I I 
Db 469 GAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAG 528 

Qy 2396 CATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAG 2455 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I 
Db 52 9 CATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAG 58 8 

Qy 2456 GAT AT AT AAGGGT GT GAT C CAGGCT AT CC AGAAAT CT GAT GAAG G C C AC C CAT T CAG GGC 2515 

I II I II M I I I I I I I I I II I I I II I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I 
Db 5 8 9 GAT AT AT AAGGGT GT GAT CCAAGCTAT CCAGAAAT CAGATGAAGGCCACCC ATT CAGGGC 64 8 

Qy 2516 AT AT T T GGAAT C T GAAGTT GCT AT AT C T GAGGAGT T GGT T C AGAAGT AC AG CAAT T CT G C 2575 

I II I I I II II I I I I I I I I I I I I I I I I I I I I I I I II I II II I I II II I I I I I I I I 

Db 64 9 AT AT TT G GAAT CT GAAGTT G C CAT AT C AGAG GAAT T G GT T C AGAAAT AT AGTAAT T CT G C 708 

Qy 2576 T CT T GGT CAT GT T AACT GC ACAAT AAAAGAACT C AGAC GC CT CTT CT T AGT T GAT GAT T T 2635 

II M I I I I II I I III I I I I I I I I I I I I I I I II II I I I I I I I I I I II I I I I I I I I 

Db 7 09 T CT T GGT CAT GT GAACAGC ACAAT AAAAGAAT T GAGG CGT CT C T T CT T AGT T GAT GAT T T 7 68 

Qy 2636 AGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC-TTGT 2694 

I II II I I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I II I I I I I I I I I M I I 
Db 7 69 AGTTGATTCCCT GAAG- TTGCAGTGTT GAT GTGGGTATTTACTTACGTT GGT GCCTTTGT 827 

Qy 2695 TCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCC 274 6 



Db 828 T CAAT G GT T T GACACT ACT GAT T T T AGC C C T GAT CT C ACT C TT C AGT AT T C C 87 9 
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CK231727 1003 bp mRNA linear EST 09-DEC-2003 

ILLUMIGEN_MCQ_28 69 Katze_MMBR Macaca mulatta cDNA 5' similar to 
human RTN4 (Hs. 436349), mRNA sequence. 
CK231727 

CK231727 . 1 GI : 39638085 
EST. 

Macaca mulatta (rhesus monkey) 
Macaca mulatta 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 
Cercopithecinae ; Macaca. 
1 (bases 1 to 1003) 

Katze,M.G., Thomas, M. , Korth,M., Iadonato, S . P . and Magness,C.L. 
Large-scale Rhesus Macaque cDNA Sequencing 
Unpublished (2003) 
Contact: C. Magness 
Illumigen Biosciences Inc. 

2203 Airport Way S, Suite 450, Seattle, WA 98134, USA 

Tel: 2063780400 

Fax: 2063780408 

Email: cmagness @ illumigen . com 

Sequenced on 2003.11.24. 765 Q20 bases. Assembles in contig w/ 2 
member (s) . Contig contains 1 (1.5%) lib members. 
PCR PRimers 

FORWARD : CCCTCACTAAAGGGAACAAAA 
BACKWARD: CACTATAGGGCGAATTGGGTA 
Insert Length: 1003 Std Error: 0.00 
Plate: CL000026 row: D column: 03 
Seq primer: CCCTCACTAAAGGGAACAAAA 
POLYA-No . 

Location/Qualifiers 
1. .1003 

/organism="Macaca mulatta" 
/mol_t ype= "mRNA" 
/strain=" Indian" 
/db_xref="taxon: 9544" 
/sex="female" 
/dev_stage=" adult" 
/lab_host="E. coli SOLR" 
/clone_lib="Katze_MMBR" 

/note="Organ: brain; Vector: Uni-ZAP XR; Site_l: EcoR I; 
Site_2: Xho I; Created from Stratagene ZAP-cDNA Synthesis 
kit (catalog #200400) and ZAP-CDNA Gigapack III Gold 
Cloning Kit (Catalog #200450)" 



ORIGIN 



Query Match 16.7%; Score 642; DB 14; Length 1003; 

Best Local Similarity 84.1%; Pred. No. le-106; 

Matches 853; Conservative 0; Mismatches 125; Indels 36; Gaps 10; 



Qy 



24 69 GT GAT C C AG G CT AT C C AGAAAT CT GAT GAAG GC C AC C CAT T CAGG GC AT AT T T G GAAT C T 2528 



I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 9 GAGAT C CAAGCT AT C CAGAAAT C AGAT GAAG GC C C C C CAT T C AGGG CAT AT CT GGAAT CT 68 

Qy 2529 GAAGT T GCT AT AT CT GAGGAGT T G GT T C AGAAGT AC AG CAAT TCTGCTCTTGGT CAT GTT 258 8 

I I I I I II I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 69 GAAGT T GC GAT AT CT GAG GAGT T G GT T C AGAAGT AC AGTAAT TCTGCTCTTGGT C AT GT G 128 

Qy 2589 AACT GC ACAATAAAAGAACT C AGACG C C T C T T CT T AGT T GAT GAT T T AGTT GAT T CT C T G 2 64 8 

II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 129 AACT GC AC GATAAAGGAACT C AG GC G CCTCTTCT T AGT T GAT GAT T T AGT T GAT T C T C T G 188 

Qy 2649 AAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACA 2708 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 189 AAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACG 248 

Qy 2709 CTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 2768 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 

Db 249 CTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 308 

Qy 2769 GC GCAAAT AGAT CATTAT CT GGGACTTGCAAATAAGAAT GTTAAAGAT GCTAT GGCTAAA 282 8 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 309 GCACAGAT AGAT CAT TAT CT AG GACTT GCAAAT AAGAAT GTTAAAGAT GCTAT GGCTAAA 368 

Qy 2 829 AT C CAAGCAAAAAT C C CT GGAT T GAAGC GT AAAGCT GAAT GAGAAAG C C T GAAAGAGT TA 2888 

I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I II II III I III 
Db 369 AT C CAAGC GAAAAT C C CT GGAT T GAAGC GCAAAGCT GAAT GAAAAC AC C CAAAATAAT T A 42 8 

Qy 28 89 ACAAT AGAGGAGT T TAT CT T TAAAGGGGAT AT T CAT TT GATT C CAT T GG GGAGGGT C AGG 2 94 8 

I I I I I I I I 11 I II I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I II 
Db 429 CTAGGAGTTCATCTTTAAAGGGGGTATTCATTTGAGTATATGGGGGAGGGTCAGG 4 83 

Qy 2 949 GAAGAACAAAGC CT T GAC AT T GC AGT GC AGT TT CAC AGAT CTTT ATT TT 2 997 

I I I II I I II I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I II 

Db 4 84 GAAGAAC GAA- C CT T GAC AT T G CAGT GC AGT T T CAC AGAT CGT T GT T AGAT CT T TAT T TT 542 

Qy 2 998 T AGCAAC GC AGT G - T CT GAGGAAAAAT GAC CT GT CT T GACT GC C CT GT GTT C AT CAT C TT 3056 

I I I I I III II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II II I 
Db 543 TAGCCAT GCACT GT T GT GAGGGAAAATT AC CT GT CTT GACT GC CAT GT GTT CAT CAT C - T 601 

Qy 3057 AAGT AT T GT AAGC T G CT AT GT AT GGAT T TAAAT C GT AAT CAT AT TTGTTTTTCCT GT AT G 3116 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I II 
Db 602 AAGT AT T GT AAGC T G CT AT GT AT GGAT T TAAAC C GT AAT CAT A — TCTTTTTCCTATCTG 659 

Qy 3117 AG GC AC T G GT GAATAAACAAAGAT CT GAGAAAGCT GT AT AT T AC AC T T T GT C G C AG GT AG 3176 

M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 660 AGGCACTGGTGGA AT AAAAAAC CT GT AT AT T T T ACT T T GT T G C AG GT AG 708 

Qy 3177 T CT T GCT GT AT - T T GG GGAAT T GC AAAGAAAGT GGAG C T GAC AGAAATAAC C C T T T T CAC 3235 

I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I 
Db 709 TCTTGCTGCATCTTGGCAAGTTGCAGAGATGGTGGAGTTAGAAAAAAAAGCCCTTTT — C 766 

Qy 3236 AGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTT 3295 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I II I I I I I I I I I I I I 
Db 7 67 AGT T T GT G C ACT GT GT AT GGT C C GT GT AGAT T GAT G C AGAT T T T C T GAAAT GAAAT GT T T 826 

Qy 3296 AGAC GAGAT CAT G C CAC CAAGG C AGG AGT GAAAAAG CT T GCCTTTCCTG GT AT GTT CT AG 3355 

I I I I I M I I I I I III M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M 



Db 



827 AGAC GAGAT CAT AC C AGT AAAG C AG GAAT GGAAAAT CCTG-CTTTTCTG GT AT GT T C TAG 8 85 



Qy 3356 GT GTATTGTGAAATTTACT GTTGTATTAATT GCCAATATAAGTAAATATAGATTATATAT 3415 

I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I II III I I I I I 

Db 88 6 G G GT AT T GT GAAT T T TACT GT T AT AT T AAT T G C C C AAT AT AAGT AAAT T AT AAAT TAT AT 945 

Qy 3416 ATCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCAC 3469 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 94 6 ATTTATAGTGGTTTCCCAAAACCTTAAAACCTTTTACCTTTCCAAGCCCCCCCC 999 



RESULT 7 
CF110956 

LOCUS CF110956 1101 bp mRNA linear EST 23-JUL-2003 

DEFINITION Shultzomica04207 Rat lung airway and parenchyma cDNA libraries 

Rattus norvegicus cDNA clone Contig3732 5 ! , mRNA sequence. 
ACCESSION CF110956 

VERSION CF110956.1 GI : 33167337 

KEYWORDS EST . 

SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

REFERENCE 1 (bases 1 to 1101) 

AUTHORS Shultz,M.A., Zhang, L., Gu,Y.-Z., Baker, G.L., Fannuchi, M. V. , 

Padua, A.M., Gurs ke , W . A. , Morin,D., Penn,S.G., Jovanovich, S . B . , 
Plopper,C.G. and Buckpitt, A. R. 
TITLE Gene Expression Analysis in Response to Lung Toxicants: I. 

Sequencing and Microarray Development 
JOURNAL Unpublished (2003) 
COMMENT Contact: Shultz MA 

Dept. of Molecular Biosciences, School of Veterinary Medicine 
University of California, Davis 

1311 Haring Hall, One Shields Avenue, Davis, CA 95616, USA 

Tel: 530 752 0793 

Fax: 530 752 4698 

Email: mashultz@ucdavis.edu 

Average Phred score is 20 or better. All poor quality data (Phred < 
20) and vector/linker sequence has been removed. 
High quality sequence stop: 1101. 
FEATURES Location/Qualifiers 
source 1 . . 1101 

/organism="Rattus norvegicus" 

/mol_type= "mRNA" 

/ strain="Sprague-Dawley" 

/db_xref="taxon: 10116" 

/clone="Contig3732" 

/sex="male" 

/tissue_type="airway or parenchyma" 
/dev_stage="adult" 

/clone_lib="Rat lung airway and parenchyma cDNA libraries" 
/note="Organ: lung; Vector: pGEM-llZf (-) ; Site_l: Eco RI; 
Site_2: Not I; mRNA was isolated from microdissected rat 
lung airways and parenchyma tissues." 

ORIGIN 



Query Match 16.5%; Score 634.2; DB 14; Length 1101; 

Best Local Similarity 80.4%; Pred. No. 2.6e-105; 

Matches 913; Conservative 0; Mismatches 183; Indels 39; Gaps 13; 

Qy 2347 TGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTG 24 06 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I II I I I I I I I I I I I I I I I I I I 
Db 1 TGGTGTTTGGTG-CAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTG 59 

Qy 2407 T AAC GGC CT AC ATT G CCTTGGCCCTGCTCTCT GT GAC TAT CAG C T T T AGGAT AT ATAAGG 2466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 60 TAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGAT AT ATAAGG 119 

Qy 2467 GT GT GAT C CAG G CT AT C C AGAAAT C T GAT GAAG G C C AC C CAT T CAG G G CAT AT T T G GAAT 252 6 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I i I I I I I I II I I I II I I I I 

Db 12 0 GC GT GAT C CAGGCT AT C C AGAAAT CAG AT GAAGGC CACC C AT T C AGGG C AT AT T T AGAAT 17 9 

Qy 2527 CT GAAGT T GCT AT AT C T GAG GAGT T GGT T C AGAAGT AC AGCAAT TCTGCTCTT GGT CAT G 2586 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Mill I I II I I I I I I I I I I I I I I I 

Db 180 CT GAAGT T G C TAT AT C AGAGGAAT T G GT T C AGAAAT AC AGTAATT CTGCTCTT GGT CAT G 239 

Qy 2587 T TAACT GC ACAAT AAAAGAACT C AGAC GC C T CT T CT T AGTT GAT GAT T T AGTT GAT T C T C 2 64 6 

I III I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 240 T GAACAGCACAATAAAAGAAC T GAGGC G GCT T T T CT T AGTT GAT GAT T T AGT T GAT T C C C 2 99 

Qy 2647 TGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGA 2706 

I I I I I I I I I I II I I I I II I I I I I I I II I I I II I I I I II I I I I I II I I I I I I I I II I I I 
Db 300 TGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGA 359 

Qy 2707 CACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATC 2 766 

I I I II M I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 

Db 360 CACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATC 419 

Qy 27 67 AGGCGCAAATAGAT CATTAT CT GGGACT T GCAAATAAGAAT GTTAAAGAT GCT AT GGCTA 2 826 

III III I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 42 0 AG GT GC AGAT AGAT CAT TAT CT AG GAC T T GCAAACAAGAGT GT T AAG GAT G C CAT GGC C A 479 

Qy 2 827 AAAT C CAAGCAAAAAT CC CT G GAT T GAAGC GTAAAGCT GAAT GAGAAAG C C T GAAAGAGT 2 8 86 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 

Db 480 AAAT C CAAGCAAAAAT CCCT GGAT T GAAGC GCAAAGCAGATT GAAAAAGCC 530 

Qy 28 87 T AACAAT AGAGGAGT T TAT C TT TAAAG G GGAT AT T CAT T T GAT T C C ATT GG GGAG G GT C A 294 6 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 531 C CAAACAGAAGT T C AT CT T TAAAGG G GACACT CAC T T GAT T AC GGGGGTGGGA 583 

Qy 294 7 GGGAAGAAC AAAGC CT T GACAT T G C AGT G C AGT T T CACAGAT CT T T ATT TT T AG C AAC GC 3006 

III I I I I I I I I II I I I I I I II I I II I I I I I I I I II 

Db 584 GGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT — CAGCTCTTTATTTTTAGCAGTGC 641 

Qy 3007 AGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTA 3066 

I III II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 642 AC T GT T T GAGGAAAAAT T AC CT GT C T T GAC TT-CCTGTGTT TAT CAT CT TAAGT AT T GT A 7 00 

Qy 3067 AGCT G CT AT GT AT G GAT T T AAAT C GTAAT CAT AT TTGTTTTTCCT GT AT GAG G CAC T G GT 312 6 

I I I I II I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I 

Db 701 AGCTGCTGTGTATGGATCT-CATTGTAGTCACACTTGTCTTCCCCAATGAGGCGCCTGGT 75 9 

Qy 3127 GAAT AAACAAAGAT C T GAGAAAG C T GT AT AT T ACACT T T GT C G CAG GT AGT CT T GCT GT A 318 6 



Db 7 60 GAATAAAGGA — CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCTAGCTGTA 817 

Qy 3187 T T T GGGGAAT T G CAAAGAAAGT G GAGC T GAC AGAAAT AAC C C T T T T C AC AGT T T GT G 3243 

I I I I I I I I I I I II I III I I I I I I II II II I I I I I II III 
Db 818 TGCAGAGAGTTGTAAAGAAGGCAAATCTGGGGGCAGGGAAAACCCTTTTCACAG — TGTA 875 

Qy 3244 C AC T GT GT AC GGT C T GT GT AG GT T GAT GC AGAT T T T CT GAAAT GAAAT GT T T AGAC GAGA 3303 

I I I I I III I I I I II I I I II I I I I I I I I II I I I I M I I 

Db 87 6 CTGTGTTTGGT CAGT GTAAAAC T GAT G C AGAT T T T T C T GAAAT GAAAT GT T T AGAT GAGA 935 

Qy 33 04 TCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTG 3363 

I I I I I I I I M I I I I I II III II I I I I I I I II I I I I I I I I I I I 

Db 936 GCATACTACTAAAGCAGAGTGGAAAACTCTGTC — TTTATGGTGTGTTCTAGGTGTATTG 993 

Qy 3364 T GAAATTT ACT GTT GTATTAATTGCCAAT ATAAGTAAATATAGATT ATATATATCTAT AT 3423 

I I I II I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I II I I I 

Db 994 TG-AATTTACTGTTAT AT T GC CAAT AT AAGT AAAT AT AGA CCTAATCTATAT 1044 

Qy 3424 ATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGA 347 8 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I || I I || I I 

Db 1045 AT AGT GT T T C ACAAAG CTT AGAT CT T TAAC CT T GC AG CT GC C C C AC AGT GC T T GA 1099 



RESULT 8 
BQ719894 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BQ719894 871 bp mRNA linear EST 16-JUL-2002 

AGENCOURT_8241298 Lups ki_sympathetic_trunk Homo sapiens cDNA clone 
IMAGE: 6187240 5', mRNA sequence. 
BQ719894 

BQ719 8 94. 1 GI : 218587 91 
EST. 

Homo s apiens ( human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 871) 

NIH-MGC. http : / / mgc . nci . nih . gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. James R. Lupski 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

http : / /image . llnl . gov 



FEATURES 

source 



Plate: LLAM13580 row: o column: 
High quality sequence stop: 677. 

Location/ Qualifiers 

1. .871 

/organism- n Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE: 6187240" 



17 



/sex= M male" 

/tissue_type= n sympathetic trunk" 
/dev_stage="adult, 16 yr" 
/lab_host="DHlOB" 

/clone_lib="Lupski_sympathetic_trunk" 

/note="Vector : pCMV-SP0RT6 (Life Technologies) ; Site_l: 
NotI; Site_2: Sail; cDNA made by oligo-dT priming. 
Directionally cloned using the following adaptors : 
5 ' -TCGACCCACGCGTCCG-3 ' and 

5 , -GACTAGTTCTAGATCGCGAGCGGCCGCCCT(15)-3 , . Size selected > 
1 kb for average insert length 1.9 kb. This is a primary 
library, non-amplified. Library constructed by Life 
Technologies and donated by J. Lupski, M.D./Ph.D. (Baylor 
College of Medicine) ; available through Life 
Technologies . 11 



ORIGIN 



Query Match 16.5%; Score 631; DB 13; Length 871; 

Best Local Similarity 86.2%; Pred. No. le-104; 

Matches 731; Conservative 0; Mismatches 110; Indels 7; Gaps 3; 

Qy 909 GAGGAAGT AGT GG CAAAC AT GC CT GAAG GT CT AAC C C C AGAT T T GGT T CAGGAAG CAT GT 968 

I III I I I I I I I I I I I I I I I I I I I II II I I II I t I I II I I I I I II I I I I I 

Db 11 G C G GAC GC GT GGG CAAAC AT GC CT GAAG GC CT GACT C C AGAT T T AGT ACAG GAAGC AT GT 7 0 

Qy 969 GAAAGT GAAT T GAAT GAAGCT ACT GGT ACAAAAAT T GC C TT T GAAACAAAAAT GGAC C T G 102 8 

I II I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 
Db 71 GAAAGT GAAT T GAAT GAAGT TACT GGT ACAAAGAT T G C T TAT GAAACAAAAAT GGACT T G 130 

Qy 102 9 GT T CAAACT T CAGAAG C T GT GCAGGAGT CACT T T AC C CT GTAAC AC AG CT T T GC C CAT C T 10 8 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I 1 I I I I I I 
Db 131 GT T CAAACAT CAGAAGT TAT GCAAGAGT CACT CT AT C CT GC AGC ACAG CT T T G C C CAT C A 190 

Qy 1089 T T T GAAGAAT CT GAAG CT ACTC CGT C AC C GGT T T T GC CT GAC AT T GT CAT GGAAGCAC C A 1148 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I 

Db 191 T T T GAAGAGT CAGAAG CT AC T C CT T C AC CAGT T TT GC C T GAC AT T GT TAT GGAAGCAC C A 250 

Qy 1149 TTA7^ATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCATCACCATTA 1208 

II 11 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 251 TTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCATCACCATTA 310 

Qy 12 09 GAAACT C T T C C T T CAGT T AATT AT GAAAGC AT AAAGT T T GAGC CT GAAAAT C C C C C AC C A 12 68 

III I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 311 GAAGC TT CT T CAGTTAATTAT GAAAGCATAAAACAT GAGC CT GAAAAC C C C C CAC CA 367 

Qy 1269 TAT GAG GAGG C CAT GAAT GT AT CAC TAAAAAAAGAAT C AG GAAT GAAT GAAGAAAT CAC A 1328 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Mill: I I 
Db 368 TAT GAAGAGG C CAT GAGT GT AT CAC T AAAAAAAGT AT C AG GAAT AAAGGAAGAAAT T AAA 427 

Qy 132 9 GAGC C T GAAG GT AT T AGT GT AGCT GT T C AG GAAAC AGAAGC T C CT T AT AT AT CT AT T G C A 1388 

I I I I I II II I I I I I II I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I M I I 
Db 428 GAGC CT GAAAAT AT T AAT GC AG CT C T T CAAGAAAC AGAAGC T C CT T AT AT AT CT AT T GC A 487 



Qy 1389 
Db 488 



T GT GAT T T AAT T AAAGAAACAAAG AT C T CT AC T GAAC C GAC T C C AGATT T CT CT AGT TAT 14 4 8 

I I I II I I II I I I I I I I I I I I I I I I I III I I I I I I I I II I I I I I I I I I I I I I I 
TGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTCTCTGATTAT 547 



Qy 1449 T CAGAAAT AGC AGAAGT T GC AC AG C C AGT GC C C GAG CAT T C T GAGCT AGT T GAAGAT T C C 1508 

I I I I I I I I III I I I I I I I I II I I I I I I II II I II I I I I I I I I I I I I I II I I I I I I 

Db 54 8 T CAGAAAT GG CAAAAGT T GAAC AG C C AGT GC CT GAT CAT T CT GAG CT AGT T GAAGAT T C C 607 

Qy 1509 T C C C C C GAT T C T GAAC C AGT T G AC T TAT T T AGT GAT GAT T C AAT AC C C G AAGT T C C AC AA 1568 

II II I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I II I I I I I I II I I I I I I I I I 

Db 608 T CAC C T GAT T C T GAAC C AGT T GACT T ATT T AGT GAT GAT T CAAT AC C T GAC GT T C CACAA 667 

Qy 1569 AAACAAGAT GAAGCT GTAAT ACTT GT GAAAGAAAACCT CACT GAAATTT CAT CT GAGT CA 1628 

I II I I I I I I I I I I I I I I! I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 668 AAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCATTTGAGTCA 727 

Qy 1629 AT GAC AGGAC AT GACAAT AAGGGAAAAC T CAGT G CT T CAC CAT C ACCT GAGGGAGGAAAA 1688 

I I I I I I I I I I I I I II 1 I I II I I I I I I I I I I I I I I I I I I I II I I II 
Db 72 8 AT GAT AGAAT AT GAAAAT AAGGAAAAACT CAGT GCT T T G C CACCTGAGGGGNAGAAG 784 

Qy 1689 C C GT AT T T G GAGT C T T T T C AGC C CAGT T T AG GCAT C ACAAAAGAT AC CTT AG- C AC CT G A 1747 

I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 785 N CAT ANT T GGAAT CT T T T AAGC T CAGTT T AGATAAC ACNAN AGATAC C CT GGT T AC C T GA 84 4 

Qy 1748 TGAAGTTT 1755 

I I I I I II 
Db 84 5 TGAAGGTT 852 



RESULT 9 
CB467517 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



CB467517 646 bp mRNA linear EST 26-MAR-2003 

733269 MARC 6BOV Bos taurus cDNA 5 f , mRNA sequence. 

CB467517 

CB467517.1 GI: 29273902 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Cetartioclactyla; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 646) 

Smith, T . P. L. , Roberts, A. J. , Echternkamp, S . E. , Chitko-McKown, C. G. , 
Wray,J.E. and Keele,J.W. 

A second set of bovine ESTs from pooled-tissue normalized libraries 
Unpublished (2003) 
Contact: Smith TPL 

USDA, ARS, US Meat Animal Research Center 
PO Box 166, Clay Center, NE 68933-0166, USA 
Tel: 402 762 4366 
Fax: 402 762 4390 

Email: smith@email.marc.usda.gov 

Single pass sequencing. Bases called with phred vO. 020425. c and 

trimmed with the aid of the trim_alt option. Vector identified with 

crossmatch vO. 990329. 

Plate: LAM8011 row: E column: 7 

Seq primer: GTAATACGACTCACTATAGGG. 

Location/Qualifiers 

1. .646 

/organism="Bos taurus" 
/mol type- "mRNA" 



/db_xref="taxon: 9913" 
/ tissue__type="pooled" 
/lab_host= M DH10B" 
/clone_lib="MARC 6B0V" 

/note-"Vector: pcDNA3.1; Site_l: EcoRI; Site_2: NotI; 
Library made with RNA pooled from multiple tissues 
including liver, lung, hypothalamus, pituitary, and 
placenta/endometrium. " 

ORIGIN 

Query Match 16.4%; Score 628.8; DB 14; Length 646; 

Best Local Similarity 99.4%; Pred. No. 2.9e-104; 

Matches 642; Conservative 0; Mismatches 2; Indels 2; Gaps 1; 

Qy 3177 T CT T G CT GT ATT T G G GGAAT T GC AAAGAAAGT GGAGCT GAC AGAAAT AAC C CT T T T C AC A 3236 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 T C T T GC T GT ATT T G GGGAAT T GC AAAGAAAGT GGAGC T GAC AGAAAT AAC C CT T T T C AC A 60 

Qy 3237 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 3296 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GT T T GT G CACT GTGT AC GGT CT GT GT AG GT T GAT GC AGATT TT CT GAAAT GAAAT GT T T A 12 0 

Qy 3297 GAC GAGAT CAT GCC ACCAAGGCAGGAGT GAAAAAGCT T GCCTTTCCTG GT AT GT T CT AGG 3356 

I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I | I I I I 
Db 121 GACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 18 0 

Qy 3357 T GT ATT GT GAAATT T ACT GT T GT AT TAAT T G C CAAT AT AAGT AAAT AT AGAT — TATATA 3414 

I I I I I I I I I I I I I I M I I I I I I I I i I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 181 T GTATT GT GAAATTTACT GT T GTATTAATTGCCAATATAAGTAAATAT AGATT ATAT ATA 24 0 

Qy 3415 TAT C T AT AT AT AGT GT T T C AC GAAGCT T AGC C CT T T AC CT T C C C AG C T GC C C C AC AGT G C 3474 

I I I I I I I I I I I I 11 I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I 
Db 241 TATCTATATATAGTGTTTCATGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGC 3 00 

Qy 34 75 T T GAT AC T T CT GT C AT GGGT T T TAT GT GT GT AGT C C CAAAGC ACAT AAGCT AGGGAGAAA 3534 

I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I II I I I I I I I I M I I I I I I II 
Db 301 T T GAT ACT T C T GT CAT G GGT T T TAT GT GT GT AGT C C CAAAGC AC AT AAGCT AG GGAGAAA 360 

Qy 3535 C GT ACTT CT AGGC G CACT AC CAT CT GT T T T CAACAC GAAC C GACG CC AT G CAAACAGAAC 3594 

I I I I I I I I I I I I I I I II I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 361 C GTAC T T CT AG GCGC ACT AC CAT C T GT TT T CAACAC GAAC C GAC GCC AT GCAAAC AGAAC 420 

Qy 3595 TCCTCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTCTGGA 3654 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TCCTCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTCTGGA 4 80 

Qy 3655 CT GAAT CT AAT GC T T C CAAAAAT GT T T G CAAAT AT CAAAC AT T GT TAT GT AAGAAAAT AT 3714 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CT GAAT CT AAT GCT T C CAAAAAT GTT T GCAAATAT CAAAC AT T GT T AT GT AAGAAAAT AT 54 0 

Qy 3715 AAAT GAC GAT T T AT ACAAT T GT GGT T T AAG C T GT AT T GAACT AAAT CT GT G GAAT GC AT T 3774 

I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 541 AAAT GAC GAT T TAT ACAAT T GT G GT T T AAG CT GT AT T GAACT AAAT C T GT GGAAT GC AT T 600 

Qy 3775 GT GAAC T GT AAAAG C AAAGT AT C AAT AAAGC T T AT AGACT T AAAAA 3820 

I I II I II II I I I I I I I I I I II I I I I I I I I I I I M II I I I I II I I I 
Db 601 GT GAAC T GT AAAAG C AAAGT AT CAAT AAAGC T T AT AGAC C T AAAAA 64 6 



RESULT 10 

BM986175 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BM986175 627 bp mRNA linear EST 22-MAR-2002 

LM24HW00149 Bos taurus LM-24-HW cDNA library Bos taurus cDNA clone 
LM-24-HW-001-49 (5 f ), mRNA sequence, 
BM986175 

BM986175.1 GI:19684841 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 627) 

Yoon,D.H., Jang,Y.S., Kim,T.H., Park,E.W., Lee,H.K., Chung, E.R., 
Sun,S.S. and Cheong,I.C. 

Gene Expression Profiling of the Bovine skeletal muscle 
Unpublished (2002) 
Contact: Dr. Du-Hak Yoon 

National Livestock Research Institute, RDA 

564 Omockchun-dong, Suwon, 441-350, Korea 

Tel: 82 31 290 1593 

Fax: 82 31 290 1792 

Email: dhyoon@rda . go . kr 

Insert Length: 839 Std Error: 0.00 

Seq primer: C AG GAAAC AG C TAT G AC 

POLYA=No . 

Location/ Qualifiers 
1. .627 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db_xref="taxon: 9913" 
/clone="LM-24-HW-001-49 (5 1 ) " 
/sex="six males mixed" 
/ tissue_type="longissimus dorsi" 
/ eel l_type=: "myocyte" 
/dev_stage="24 months old" 
/lab_host="XLl-BlueMRF ' strain" 

/clone_lib="Bos taurus LM-24-HW cDNA library" 
/note="Organ : skeletal muscle; Vector: Uni-ZAPXR; Site_l: 
EcoRI; Site 2: Xho I" 



ORIGIN 



Query Match 16.3%; 
Best Local Similarity 99.8%; 
Matches 626; Conservative 



Score 625.4; DB 12; 
Pred. No. 1.2e-103; 
0; Mismatches 1; 



Length 62 7; 
Indels 0; 



Gaps 



0; 



Qy 2766 C AGGC GCAAAT AGAT CAT TAT CT GG GACT T G CAAAT AAGAAT GT T AAAGAT G C TAT GG C T 2825 

II I I I I I I I I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 1 I I I 11 I I I I I 
Db 1 C AGGC GCAAAT AGAT CAT TAT CT GG GACT T G CAAAT AAGAAT GTTAAAGAT GC T AT GGC T 60 



Qy 2 826 AAAAT C CAAGCAAAAAT C C CT GGAT T GAAGC GT AAAG C T GAAT GAGAAAG C C T GAAAGAG 2 8 85 

I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I I M 
Db 61 AAAAT C CAAGCAAAAAT C C CT G GAT T GAAGC GT AAAG CT GAAT GAGAAAGC C T GAAAGAG 120 



Qy 288 6 T T AACAAT AGAGGAGT T TAT CT T TAAAGG G GAT AT T C AT TT G AT T C CAT T GGGGAG GGT C 2 94 5 

I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I 
Db 121 T T AACAAT AGAGGAGT TT AT C T T T AAAG G GGAT AT T CAT TT GAT T C CAT T G GG GAGG GT C 18 0 

Qy 2946 AG GGAAGAACAAAG C C TT GACAT T GC AGT GC AGT T T C AC AGAT CT T TAT T T T T AGCAAC G 3005 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 181 AG G GAAGAACAAAGC CT T GACAT T GC AGT G C AGT T T C AC AGAT C T T TAT T T T TAG CAAC G 24 0 

Qy 3006 CAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGT 3065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 241 C AGT GT CT GAG GAAAAAT GAC CT GT CT T GAC T G C C C T GT GT T CAT CAT C T T AAGT ATT GT 300 

Qy 3066 AAGCT GCT AT GT AT GGAT T T AAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAGGC ACT G G 3125 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 301 AAG CT G CT AT GT AT GGAT T TAAAT C GT AAT CAT AT T T GT T T TT C CT GT AT GAG GC ACT GG 3 60 

Qy 312 6 T GAATAAACAAAGAT CT GAGAAAG CT GT AT AT T AC ACT T T GT C GCAG GT AGT CTTGCTGT 318 5 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 361 T GAATAAACAAAGATCTGAGAAAGCT GTAT ATT ACACTTTGTC GCAGGTAGT CTT GCT GT 42 0 

Qy 318 6 AT T T G GGGAAT T G CAAAGAAAGT GGAG C T GAC AGAAATAAC CCTT TT C ACAGT T T GT GC A 3245 

I I I I I I I I I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 421 AT T T G GGGAAT T G CAAAGAAAGT GGAGCT GAC AGAAATAAC CCTT TT C ACAGT T T GT GC A 4 80 

Qy 324 6 CT GT GT AC GGT CT GT GT AG GT T GAT GC AGAT T T T CT GAAAT GAAAT GT T T AGAC GAGAT C 3305 

I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 481 CT GT GT AC GGT CT GT GT AGGT T GAT GCAGAT T T T C T GAAAT GAAAT GT TT AGAC GAGAT C 54 0 

Qy 3306 ATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTG 3365 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 541 ATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTG 600 

Qy 3366 AAATT TACT GTT GTAT TAATTGCCAAT 3392 

III I I I I I I I I I I I M I I I I M I I I I 

Db 601 AAAAT TACT GT T GT ATTAAT T GC CAAT 627 



RESULT 11 

BG570231 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BG570231 843 bp mRNA linear EST 10-APR-2001 

602590632F1 NIH_MGC_77 Homo sapiens cDNA clone IMAGE: 4717662 5', 
mRNA sequence. 
BG570231 

BG570231 . 1 GI : 13577 884 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 843) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail . nih. gov 

Tissue Procurement: CLONTECH Laboratories, Inc. 
cDNA Library Preparation: CLONTECH Laboratories, Inc. 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics , Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : //image . llnl . gov 
Plate: LLCM1567 row: o column: 07 
High quality sequence stop: 801. 
FEATURES Location/Qualifiers 
source 1. .843 

/organism="Homo sapiens" 
/mol_type= M mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 4717662" 

/lab_host="DH10B (Tl phage-resis tant ) " 
/ clone_lib= "NIH_MGC_7 7 " 

/note="Organ: lung; Vector: pDNR-LIB (Clontech) ; Site_l: 
Sfil (ggccgcctcggcc) ; Site_2: Sfil (ggccattatggcc) ; 5 f and 
3 f adaptors were used in cloning as follows: 5 1 adaptor 
sequence: 5 1 -CACGGCCATTATGGCC-3 1 and 3 ! adaptor sequence: 
5 1 -ATTCTAGAGGCCGAGGCGGCCGACATG-dT ( 30 ) BN-3 r (where B = A, 
C, or G and N = A, C, G, or T) . Average insert size 1.9 
kb (range 0.5-4.0 kb) . 12/15 colonies contained inserts 
by PCR. This library was enriched for full-length clones 
and was constructed by Clontech Laboratories (Palo Alto, 
CA) . Note: this is a NIH_MGC Library." 

ORIGIN 

Query Match 16.2%; Score 621.4; DB 12; Length 843; 

Best Local Similarity 87.6%; Pred. No. 5.9e-103; 

Matches 742; Conservative 0; Mismatches 86; Indels 19; Gaps 5; 

Qy 2304 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 GT T GT T GAC C T C C T GT ACT GGAGAGACAT T AAGAAGACT GGAGT G GT GT T T GGT GC C AGC 61 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 62 CT AT TCCTGCTGCTTT C ATT GAC AGT AT T C AGC AT T GT GAG C GTAAC AGC C T ACAT T GC C 121 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I 
Db 122 TTGGCCCTGCTCTCT GT GAC CAT C AGC T T TAG GAT AT ACAAG GGT GT GAT C C AAG CT AT C 181 

Qy 2484 CAGAAAT CT GAT GAAGGC C AC C CAT T CAGG G CAT AT T T G GAAT CT GAAGT T GCT ATAT CT 2543 

I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I II I I I I I I I I I III II I 

Db 182 CAGAAAT CAGAT GAAG GC C AC C CAT T C AGGG CAT AT C T GGAAT CT GAAGT T GC TAT AT CT 241 

Qy 2544 GAG GAGT T G GT T C AGAAGT AC AGCAAT TCTGCTCTTGGT CAT GT TAACT G C ACAAT AAAA 2603 

II I I I I I I I II I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 242 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 301 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

I I I I I I I I I 1 I I I 1 I I I I I I I I I I II I I I I I I II I I I I I I || I | I I I I I I I || I M I I I 

Db 302 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 361 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 



Db 



3 62 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 421 



Qy 2724 CT GAT T T C AC T CT T C AGT GT T C C T GT TAT T TAT GAAC G GC AT C AG G C G C AAAT AGAT CAT 2783 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I 
Db 422 C T CAT T T C ACT CT T C AGT GTTCCTGT TAT T TAT GAAC GG CAT CAG G C AC AGAT AGAT CAT 4 81 

Qy 278 4 TATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 2 84 3 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 4 82 TATCTAGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 541 

Qy 2844 C CT GGAT T GAAG C GT AAAGC T GAAT GAGAAAGC C T GAAAG AGT T AACAAT AGAG GAGT T T 2903 

I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Nihil 
Db 542 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 596 

Qy 2904 AT CT T TAAAG G GGAT ATT CAT T T GAT T C CAT T G G GGAG GGT C AGG GAAGAACAAAGC CT T 2963 

I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I II I I 
Db 597 AT CT T TAAAG G G GAT AT C CAT T T GAT T AT AC GG G G GAGGGT C CAG GAAGAAC GAAC C T T G 656 

Qy 2 964 GACATT GCAGT GCAGTTT CACAGAT CTTTATTTTTAGCAACGCAGTG-T 3011 

I I I I I I I I I I I I II I I I II I I I I I I I I I I II I I I III II I 

Db 657 ACGTTTGCAGTGCAGTTT CACAGAT CGT AGT AGAT CCTTTATTTTTAGCCCTGCACTGTT 716 

Qy 3012 CT GAG GAAAAAT GACC T GT CT T GACT - GC C CT GT GT T CAT CAT CT TAA- GT AT T GTAAG C 3069 

II I I II I I I I I I I I I I II I M III III I I I I II I I I I I I M I I I I I I I I I 
Db 717 GT G C AGGAAAAT T ACC T GT CT T GACT G GC CAT GT T GT CAT CAT CT T AAGGT AT T GTAAG C 77 6 

Qy 307 0 T GCT AT GT AT GGATT TAAAT C GT AAT CAT AT T T GT T T T T C CT GT AT GAGGC ACT GGT GAA 312 9 

I I I I I I I I I I I I I I I I I M I I I I III I II I I I I I Ml 

Db 7 77 TGCTATGTATGGATTTAAACCGTATCATATCTTTTCCTATCTAATCTGAGGACTGTGGAA 836 

Qy 3130 TAAACAA 3136 

I I I I I 

Db 837 TTAAAAA 843 



RESULT 12 

CF768055 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



CF768055 619 bp mRNA linear EST 17-OCT-2003 

CES004952 Bos taurus skin cDNA library Bos taurus cDNA clone 
CCL004 952 5 1 , mRNA sequence., 
CF768055 

CF7 68055. 1 GI: 37717274 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 619) 

Wang,Y.H., McWilliam, S. and Lehnert,S. 
Transcription profiling of cattle skin 
Unpublished (2003) 
Contact: Dr Yonghong Wang 
Functional Genomics Lab 
CSIRO Livestock Industries 

Level 5, Queensland Biosciences Precinct, University of Queensland, 
306 Carmody Road St. Lucia QLD Australia 



Tel: 07 3214 2445 
Fax: 07 3214 2685 
Email: Yonghong.Wang@csiro.au 
Plate: 6 row: D column: 10. 
FEATURES Location/Qualifiers 
source 1. .619 

/organism= M Bos taurus" 

/mol_type="mRNA" 

/strain="Heref ord Shorthorn" 

/db_xref="taxon: 9913" 

/clone="CCL004952" 

/sex="f emale" 

/ tissue_type="pooled" 

/dev_stage="Adult" 

/lab_host="XLl-BlueMRF I strain" 

/clone_lib="Bos taurus skin cDNA library" 

/note="0rgan: skin; Vector: Uni-ZAPXR; Site_l: EcoRI ; 

Site_2: Xho I; Library made from pooled skin of adult 

female Hereford-Shorthorn." 

ORIGIN 

Query Match 16.1%; Score 619; DB 14; Length 619; 

Best Local Similarity 100.0%; Pred. No. 1.7e-102; 

Matches 619; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2507 ATT CAGGGCATATT TGGAAT CT GAAGTT GCTATAT CT GAGGAGTT GGTT CAGAAGTACAG 2566 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 
Db 1 AT T CAGGGC AT ATT T GGAAT CT GAAGT T GC T AT AT C T GAGGAGT T GGT T CAGAAGTACAG 60 

Qy 2567 CAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGT 2626 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I M I I I I II I I I I I I I I 
Db 61 CAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGT 120 

Qy 2 627 T GAT GAT T T AGT T GAT T CT C T GAAGT T T GCAGT GT T GAT GT GG GT AT T T AC CT AT GT T GG 268 6 

I II M I I I II I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 121 T GAT GAT TT AGT T GAT T CT CT GAAGT T T G C AGT GT T GAT GT GGGT AT T T AC CT AT GT T G G 180 

Qy 2687 TGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCC 274 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II II 
Db 181 TGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCC 240 

Qy 2747 T GT TAT T TAT GAAC GG CAT C AG G C G C AAAT AG AT CAT TAT C T G G G AC T T G C AAAT AAGAA 2806 

I I I I I II I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 T GT TAT T TAT GAAC G G CAT C AGG C G C AAAT AGAT CAT TAT CT G G GAC T T GC AAAT AAGAA 300 

Qy 2807 TGTTAAAGATGCTATGGCTAA7\ATCCAAGCAA7WVTCCCTGGATTGAAGCGTAAAGCTGA 2866 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M 

Db 301 T GT TAAAGAT GC T ATG GCTAAAAT C CAAG C AAAAAT C C C T GGAT T GAAG C GT AAAG CT GA 360 

Qy 2 867 ATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTT 2926 

M II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTT 420 

Qy 2927 GAT T C CAT T G GGGAGGGT C AG G GAAGAACAAAGC CT T GAC AT T GCAGT GCAGT T T C AC AG 2 986 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I 

Db 421 GATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCACAG 480 



Qy 2987 ATCTTTATTTTTAGCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGT 304 6 

I I I I M I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 481 ATCTTTATTTTTAGCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGT 54 0 

Qy 3047 T CAT C AT CTTAAGT AT T GTAAG CT GC T AT GT AT GGAT T TAAAT C GTAAT CAT AT T T GT T T 3106 

I I I II I II I M I I I I II I I I I II I M I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I 

Db 541 T CAT CAT C TT AAGT AT T GTAAG C T GC T AT GT AT GGAT TT AAAT C GTAAT CAT AT T T GT T T 600 

Qy 3107 TTCCTGTATGAGGCACTGG 3125 

II I I I I I I I I I II I I I I I I 

Db 601 TTCCTGTATGAGGCACTGG 619 



RESULT 13 

CK232009 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



EST 09-DEC-2003 
5 f similar to 



FEATURES 

source 



CK232009 1164 bp mRNA linear 

ILLUMIGEN_MCQ_3196 Kat ze_MMPL2 Macaca mulatta cDNA 
human RTN4 (Hs. 436349), mRNA sequence. 
CK232009 

CK232009.1 GI:39638367 
EST . 

Macaca mulatta (rhesus monkey) 
Macaca mulatta 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 
Cercopithecinae; Macaca. 
1 (bases 1 to 1164) 

Katze,M.G., Thomas, M. , Korth,M., Iadonato, S . P . and Magness,C.L. 
Large-scale Rhesus Macaque cDNA Sequencing 
Unpublished (2003) 
Contact: C. Magness 
Illumigen Biosciences Inc. 

2203 Airport Way S, Suite 450, Seattle, WA 98134, USA 

Tel: 2063780400 

Fax: 2063780408 

Email: cmagness@illumigen.com 

Sequenced on 2003.11.25. 797 Q20 bases. Assembles in contig w/ 2 
member (s). Contig contains 1 (0%) lib members. 
PCR PRimers 

FORWARD : CCCTCACTAAAGGGAACAAAA 
BACKWARD: CACTATAGGGCGAATTGGGTA 
Insert Length: 1164 Std Error: 0.00 
Plate: CL000027 row: F column: 08 
Seq primer: CCCTCACTAAAGGGAACAAAA 
POLYA=No . 

Location/ Qualifiers 
1. .1164 

/organism="Macaca mulatta" 

/mol_type="mRNA" 

/strain=" Indian" 

/ db_x r e f = " t axon : 9 5 4 4 " 

/sex-"male" 

/dev_stage="newborn infant" 
/lab_host="E. coli SOLR" 
/clone_lib="Katze_MMPL2 " 

/note="Organ: placenta; Vector: Uni-ZAP XR; Site_l: EcoR 
I; Site__2: Xho I; Created from Stratagene ZAP-cDNA 



Synthesis kit (catalog #200400) and ZAP-CDNA Gigapack III 
Gold Cloning Kit (Catalog #200450)" 

ORIGIN 

Query Match 16.1%; Score 618.6; DB 14; Length 1164; 

Best Local Similarity 83.9%; Pred. No. 1.7e-102; 

Matches 890; Conservative 0; Mismatches 124; Indels 47; Gaps 15; 

Qy 2428 CCCTGCTCTCTGT GAC TAT CAG C T T T AGGAT AT AT AAG G GT GT GAT C C AGG C TAT C C AGA 2487 

I I Ml I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I II I I I I 

Db 48 CGCGGCTGCAGGTGACCATCAGCTTTAGGCT-TACAGGGGTGTGATCCAAGCTATCCAGA 106 

Qy 24 88 AATCT GAT GAAGGC C AC C CAT T CAG GG CAT AT T T GGAAT CT GAAGTT G C T AT AT CT GAGG 2547 

I I M I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 107 AATCAGAT GAAGGCCACC CATT CAGGGCATAT CT GGAAT CT GAAGTT GCGAT AT CT GAGG 166 

Qy 2548 AGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAAC 2607 

I I I I I I I II I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 167 AGTT GGT T C AGAAGT AC AGT AAT TCTGCTCTTGGT CAT GT GAAC T GC AC GATAAAG GAAC 22 6 

Qy 2608 T C AGAC G C C T CTT CT T AGT T GAT GAT T T AGT T GAT T C T CT GAAGT T T GC AGT GT T GAT GT 2667 

I I I I I I I II I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 227 TCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGT 28 6 

Qy 2668 GGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGA 2727 

I I I I I I I I I I I I I I I I I I I II I I I I I II I I I II I I I I I I I I I II I I I I I I I I I II I 
Db 2 87 GGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACGCTACTGATTTTGGCTCTCA 346 

Qy 2728 TT T CACT CTT CAGT GT T C CT GT TAT T TAT GAAC - GGC AT C AGGC G CAAAT AGAT CATT AT 2786 

I I I I I II I I I I I I I I I I I I I I I I I I I II I II II I I I I II I I I I II I I I I I M I I I I I 

Db 347 T T T CACT CTT CAGT GTT C C T GT TAT T TAT GAACN GG C AT CAG G C AC AGAT AGAT CATT AT 4 06 

Qy 27 87 C T GGGAC T T GCAAATAAGAAT GT T AAAGAT GCT AT GGC T AAAAT C CAAGCAAAAAT CC CT 2 8 46 

II I M I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I MINIMI 

Db 4 07 CT AGGACT T G CAAAT AAGAAT GT TAAAGAT G CT AT G G CTAAAAT C CAAGC GAAAAT CC C T 4 66 

Qy 284 7 G GAT T GAAG C GTAAAGC T GAAT GAGAAAGCCT GAAAGAGT TAAC AAT AGAG GAGT TT AT C 2 9 06 

I I I I I I I I I II I II I I I I I I I I I I I II II I I I I I I I I I I I I I I I 

Db 4 67 GGAT T GAAGC GCAAAGCT GAAT GAAAAC AC C C AAAAT AAT T A CTAGGAGT T CAT C 521 

Qy 2907 T TTAAAG G G GAT ATT CAT T T GATT C CAT T GGG GAG GGT C AG GGAAGAACAAAG C CT TGAC 2966 

I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I II I I I I I I I 
Db 522 T T T AAAG G GG GT ATT C AT T T GAGT AT AT GGG GGAG GGT CAGG GAAGAAC GAA- C C T T GAC 580 

Qy 2967 ATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG-TCTG 3014 

I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I III II I II 

Db 581 ATTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTG 64 0 

Qy 3015 AGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTA 3074 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 641 AGGGAAAATTACCTGTCTTGACTGCCATGTGTTCATCATC-TAAGTATTGTAAGCTGCTA 699 

Qy 3075 T GT AT GGAT T T AAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAG G C AC T GGT GAATAAAC 3134 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I 
Db 7 00 T GT AT GGAT T T AAAC C GT AAT CAT A — TCTTTTTCCTATCTGAGGCACTGGTGGA 752 

Qy 3135 AAAGAT CTGAGAAAGCTGT AT ATT ACACTTTGTCGCAGGT AGT CTT GCT GT AT -TTGGGG 3193 



I I III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

Db 753 ATAAAAAACCTGTATATTTTACTTTGTTGCAGGTAGTCTTGCTGCATCTTGGCA 8 06 

Qy 3194 AATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTAC 3253 

I II II I I II II I I I I I I III I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 807 AGTTGCAGAGATGGTGGAGTTAGAAAAAAAAGCCCTTTT--CAGTTTGTGCACTGTGTAT 8 64 

Qy 3254 G GT - CT GT GT AGGT T GAT GC AGAT T T T CT GAAAT GAAAT GT T T AGAC GAGAT CAT G C C AC 3312 

III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I || 
Db 865 GGT CC C GT GTAGATT GATGCAGATT T T CT GAAAT GAAAT GTTTAGACAAGAT CAT ACC CA 924 

Qy 3313 CAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTA 3372 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | II 

Db 925 GTAAAGCAGGAATGAAAAATCTTGCTTTTCTGGTATGTTCTAGGGGTATTGGGACTTTTA 98 4 

Qy 3373 CT GT T GT ATT AAT T GC CAAT - AT AAGTAAAT AT AGAT TAT AT AT AT C T AT AT AT AGT GT T 3431 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I II 

Db 98 5 C T GGT AT AT T AAT T G C C AAT AAT AAGT AAAT AT AAAT T A AT ATAAT AT AAGT GT T 1039 

Qy 3432 T C AC GAAGCT T AGCCCTTTACCTTCCCAGCTGCCCCAC 3469 

I I I II I I I I I I I I I I I I I I I I I I I Mill 
Db 104 0 TCCCAAAACTTTAAAACCTTTTACCTTTCCAGCCCCCCCCC 108 0 
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DEFINITION 
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REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
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BF044381 631 bp mRNA linear EST 10-OCT-2000 

BP250021B10D3 Soares normalized bovine placenta Bos taurus cDNA 
clone BP250021B10D3 5', mRNA sequence. 
BF044381 

BF044381.1 GI:10761508 
EST. 

Bos taurus ( cow) 
Bos taurus 

Eukaryota; Metazoa; Chorciata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 631) 

Lewin, H. A. , Soares, M.B., Rebeiz,M. , Pardinas,J., Liu,L. and 
Larson, J.H. 
Bovine ESTs 
Unpublished (2000) 
Contact: Lewin, H. A. 

W. M. Keck Center for Comparative and Functional Genomics 
University of Illinois at Urbana-Champaign 

340 Edward R. Madigan Laboratory, 1201 W. Gregory Dr., Urbana, IL 

61801, USA 

Tel: 217 333 5998 

Fax: 217 244 5617 

Email: h-lewin@uiuc.edu 

Funding for cattle EST sequencing was provided by the USDA National 
Research Initiative, Animal Genome Resource Grant AG 99-3205-8534 
to H. A. Lewin and J. E. Womack. Base Calling/Quality Scores: PHRED 
from Washington University Genome Center. Vector Trimmi g: 
Cross_match from Washington University Genome Center PHRAP suite. 
This sequence is vector free and at least 200 bp in length. 
PCR PRimers 



FORWARD : TAAT AC GAC T C ACT AT AG G G 
BACKWARD: AT T AAC C C T C AC T AAAG 
Insert Length: 631 Std Error: 0.00 
Plate: BP250021B10 row: D column: 3 
Seq primer: AGC GGATAACAAT T T CAC AC AGGA 
High quality sequence stop: 631. 
FEATURES Location/Qualifiers 
source 1. .631 

/organism="Bos taurus" 

/mol_type="mRNA M 

/db_xref="taxon: 9913" 

/clone="BP25 0021B10D3" 

/sex=" female" 

/lab_host="DH10B" 

/clone_lib="Soares normalized bovine placenta" 
/note="Organ : placenta; Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: NotI; The cDNA library was contributed by the 
Soares laboratory and it was constructed and normalized 
as described by Bonaldo, M.F., Lennon, G. and Soares, 
M.B. (1996), Genome Research 6(9): 791-806. " 

ORIGIN 

Query Match 16.1%; Score 617; DB 10; Length 631; 

Best Local Similarity 99.7%; Pred. No. 4e-102; 

Matches 62 9; Conservative 0; Mismatches 0; Indels 2; Gaps 1; 

Qy 2834 AG CAAAAAT C C C T GGATT GAAGC GTAAAG CT GAAT GAGAAAGC CT GAAAGAGT T AACAAT 2893 

M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I ! I I I I I I I II I I I I I I I M II I I I I I I I 
Db 1 AGCAAAAAT C C CT G GATT GAAG C GTAAAGCT GAAT GAGAAAGC C T GAAAGAGT T AACAAT 60 

Qy 28 94 AGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGA 2953 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 61 AGAGGAGT T TAT C T T T AAAGGGGAT AT T CAT T T GAT T C CAT T GG G GAGGGT C AGG GAAGA 12 0 

Qy 2954 ACAAAGC CT T GACAT T GCAGT G C AGT T T C AC AGAT CT T TAT T T T TAG CAAC G C AGT GT C T 3013 

I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I 
Db 121 ACAAAGC CT T GACAT T GCAGT GCAGT T T CAC AGAT CT T TAT T T T T AGCAAC GCAGT GT CT 18 0 

Qy 3014 GAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCT 3073 

I I I I I I I I I I II I I I I I I I I II I I II I 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 181 GAG GAAAAAT GAC CT GT CT T GACT GCCCTGTGTT CAT CAT CT T AAGT AT T GT AAG C T G CT 24 0 

Qy 3074 AT GT AT G GAT T TAAAT C GT AAT CAT AT TTGTTTTTCCT GT AT G AG GC AC T G GT GAAT AAA 3133 

I II I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 241 AT GT AT G GAT T TAAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAGGC AC T GGT GAAT AAA 300 

Qy 3134 CAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGG 3193 

I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGG 360 

Qy 3194 AAT T GCAAAGAAAGT G GAGCT GAC AGAAAT AAC C C T T T T CAC AGT T T GT GC AC T GT GT AC 32 53 

I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 AAT T GCAAAGAAAGT GGAGC T GACAGAAAT AAC C C T T T T CACAGT T T GT GC ACT GT GT AC 42 0 

Qy 3254 G GT C T GT GT AGGT T GAT G C AGAT T T T CT GAAAT GAAAT GT T T AGAC GAGAT CAT G C CAC C 3313 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 421 G GT C T GT GT AG GTT GAT GC AGATT T T CT GAAAT GAAAT GT T T AGAC GAGAT CAT GC CAC C 480 



Qy 

Db 

Qy 

Db 

Qy 

Db 



3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 
I I I I I I I I I I II I I II I I I I I I I I I I I I II I I I I I I I I I I II I M I I I I I I I I I I I I I I I 
481 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 54 0 

3374 T GT T GT AT T AAT T GC CAAT AT AAGT AAAT AT AGAT — TAT AT AT AT CT AT AT AT AGT GT T 3431 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I II I I I I I I I I I I I I I I I I I I 
541 T GTT GTAT TAATTGC CAATATAAGTAAAT AT AGAT T AT AT AT AT AT C TAT AT AT AGT GTT 600 

3432 TCACGAAGCTTAGCCCTTTACCTTCCCAGCT 34 62 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 T C AC GAAG CT T AGC CCTT T AC C T T C C C AGCT 631 
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CB521332 

LOCUS 

DEFINITION 
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VERSION 
KEYWORDS 
SOURCE 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



CB521332 822 bp mRNA linear EST 09-JUL-2003 

UI-M-GH0-cem-h-13-0-UI.rl NIH_BMAP_GH0 Mus musculus cDNA clone 
IMAGE: 6841502 5', mRNA sequence. 
CB521332 

CB521332. 1 GI: 29354687 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 822) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Distribution information can be found at 

http : / /genome . uiowa . edu/distribution/mousef 1 . html 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5. 

Location/ Qualifiers 
1. .822 

/organism="Mus musculus" 

/mol_type= M mRNA" 

/strain="C57BL/6 n 

/db_xref="taxon: 10090" 

/ clone- " IMAGE :6841502" 

/tissue_type="Whole brain" 

/dev_stage="l, 5, and 15 days newborn" 

/lab_host="DH10B (Tl phage resistant)" 

/clone_lib="NIH_BMAP_GHO M 

/note="Organ: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 



primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is CGAACTGAAT . This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System 1 , supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 



ORIGIN 



Query Match 16.0%; Score 614.2; DB 14; Length 822; 

Best Local Similarity 85.4%; Pred. No. 1.2e-101; 

Matches 713; Conservative 0; Mismatches 108; Indels 14; Gaps 2; 

Qy 1885 CT C C GAT T GAGAT T AT AGAT GAGT T C C C GAC CT T T GT CAGT T CT AAAGC AGAT T C T T CT C 1944 

I I I I I I I I I I I I I I I I i I I I II II I I I I I I I II I I I I I I MINI II 
Db 1 CT CC CATT GAGAT AATAGATGAGTTT C C CACAT TT GT CAGT GCTAAAGAT GAT T CT CCT - 59 

Qy 1945 CTACATTAGCCAGGGAATACACTGACCTAGAAGTAGCCCACAAAAGTGAAATTGCTGACA 2004 

I I II I I I I I II I I I I I I I I I I I II I I II I I I I II I I I I I I I I 
Db 60 AAGGAGTACACTGACCTAGAAGTATCCAACAAAAGTGAAATTGCTAATG 108 

Qy 2005 TCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGATTGCCCCATGACCTTTCTTTCAAGA 2 064 

Mill I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II II I I 

Db 109 TCCAGAGCGGGGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGA 168 

Qy 2065 GT AT ACAAC CT AAAGAGGAAGT T CAT GT C CCAGAT GAGT T C T C CAAAGAT AG G GGT GAT G 2124 

I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I t II I I II 
Db 169 AT AC AT AT C CT AAAGAT GAAGC ACAT GT CT CAGAT GAAT T CT C CAAAAGT AG GT C CAGT G 228 

Qy 2125 TTTCAAAGGTGCCCGTACTGCCT CCAGAT GTTTCTGCTTTGGATGCTCAAGCAGAGAT AG 2184 

I II I I I I I I I II II III I I I I I I M I I I I I I I I I I I I I I I I I I II II I 
Db 229 TATCTAAGGTGCCCTTATTGCTTCCAAATGTTTCTGCTTTGGAATCTCAAATAGAAATGG 288 

Qy 2185 GCAG CAT AGAAAAAC C CAAAGT T CT T GT GAAAGAAGC C GAGAGAAAAC TTC C TT C T GAT A 2244 

III II I I I I I I I I I I I I II III I I I I I I I I I III I I I I I I I I I I I I I I I I I 
Db 28 9 GCAACAT AGT T AAAC C CAAAGT ACT T AC GAAAGAAGC AGAG GAAAAACT T C CT T CT GAT A 348 

Qy 2245 C AGAAAAAGAGC GAAGAT CT C CAT CT GCT ATAT T T T C AGC AGAGC T GAGT AAAAC T T C AG 2 304 

I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 34 9 CAGAGAAAGAGGACAGATCCCTGACAGCTGTATTGTCAGCAGAGCTGAATAAAACTTCAG 4 08 

Qy 2305 TTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCT 2364 

I I I I I I I I I I II I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 4 09 TTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCT 4 68 

Qy 2365 TGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCT 2424 

I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I II I 
Db 4 69 TATTCCTGCTGCTGTCTCTGACAGTGTT — T CATT GT CAGT GTAACGGCCT AC ATT GCCT 526 

Qy 2425 TGGCCCTGCTCTCTGT GAC TAT C AG CT T T AGGAT AT AT AAGGGT GT GAT C C AG GC T AT C C 24 84 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 527 TGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAAGCTATCC 58 6 



Qy 



2485 AGAAAT CT GAT GAAGGC C AC C CAT T C AG G G CAT AT T T GGAAT CT GAAGT T GC T AT AT C T G 254 4 



I I I I I I I I I M I I I I I I I II I I II I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I 

Db 587 AGAAAT CAGAT GAAGGC C ACCCAT T CAGGGCATAT T T GGAAT CT GAAGT T GC CAT AT CAG 646 

Qy 2545 AGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAG 2604 

I I I I I I I I I I I I I I I II II I I I I I I I I I I II I I I I I I I I III I I I I I I I I I I I I 

Db 647 AG GAAT T G GT T C AGAAAT AT AG TAAT TCTGCTCTTGGT CAT GT GAAC AGC AC AAT AAAAG 7 06 

Qy 2605 AACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGA 2664 

II I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 7 07 AATTGAGGCGTCTCTTCTTAGTTGATGACTTAGTTGATTCCCTGAAGTTTGCAGTGTTGA 7 66 

Qy 2665 TGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2719 

I I I I I I I I I I I I I II I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I III 
Db 7 67 TGTGGGTATTTACTTACGTTGGTGCCTTGTTCAATGGTTTGACACTACTGACTTT 821 



Search completed: September 11, 2004, 15:09:51 
Job time : 9102.67 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



September 11, 2004, 00:39:35 ; Search time 14422.6 Seconds 

(without alignments) 
11519.006 Million cell updates/sec 

US-09-830-972-28 
3833 

1 ctatctcctctctcagccgc ttaaaaaaaaaaaaaaaaaa 3833 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Database 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om: * 
gb_ov: * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_s y : * 
gb_un : * 
gb_vi : * 
em_ba : * 
em_f un : * 
em_hum: * 
em_in : * 
em_mu : * 
em_om : * 
em_or : * 
em_ov: * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em sts:* 



28 


: em 


un : * 


29 


: em 


vi : * 


30 


: em_ 


htg hum:* 


31 


: em 


htg inv:* 


32 


em 


_htg_other : * 


33 


ern_ 


_htg_mus : * 


34 


em 


_htg_pln : * 


35 


em 


_htg_rod: * 


36 


em 


_htg_mam: * 


37 


em 


htg vrt:* 


38 


em 


sy : * 


39 


em 


htgo hum:* 


40 


em 


htgo mus : * 


41: 


em 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





Score 


Match 


Length 


DB 


ID 


Description 


1 


2716 


/ U . 


y 


4060 


9 


AY123250 


AY123250 


Homo sapi 


2 


2716 


7 0 . 


9 


4070 


9 


AY123249 


AY123249 


Homo sapi 


3 


2716 


t n 
/ U . 


y 


4102 


9 


AY123245 


AY123245 


Homo sapi 


4 


2716 


70 . 


9 


4109 


9 


AY123248 


AY123248 


Homo sapi 


5 


2716 


70. 


9 
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AY123250 

AY123250. 1 GI: 26800589 
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Location/ Qualifiers 

1. .4060 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/ chromosome=" 2 " 
/map="2pl6" 
/tissue_type= "testis" 
1. .4060 
/gene="RTN4" 



/note="synonym: NOGO" 
5'UTR 1. .133 

/gene="RTN4" 
CDS 134. .3094 

/gene="RTN4" 

/note="RTN4-G; alternatively spliced" 
/codon_start=l 
/product="RTN4 isoform G" 
/protein_id="AAM64254 . 1 " 
/db_xref= n GI : 2 6800590" 

/ trans lation="MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHE 
YLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSP 
KAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAK 
DSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKC 
FADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPL 
LGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTPCVTE 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 
SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENP 
PPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPD 
FSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLT 
ETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPL 
QMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE 
YTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATS 
KVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTS 
WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 
AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLK 
FAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMA 
KIQAKIPGLKRKAE" 
3'UTR 3095. .4060 

/gene="RTN4" 

ORIGIN 

Query Match 70.9%; Score 2716; DB 9; Length 4060; 

Best Local Similarity 86.0%; Pred. No. 0; 

Matches 3332; Conservative 0; Mismatches 445; Indels 99; Gaps 25; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

M I I I I I M I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I II II 

Db 227 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 286 

Qy 61 GT AC T GC C C AC T GAAG GAACAC T T C C AGCAACT T CAAAT GAAGC T T C T AAAG CAT T CT C A 120 

Ml I M I I I I I I M I II I I I I I I II II I I II I I I I I I I I I I I I I I I I I 
Db 2 87 GT AT T AC C C ACT GAAG GAACAC T T CAAGAAAAT GT C AGT GAAGCT T CTAAAGAGGT CT C A 34 6 

Qy 121 GAGAAG GC AAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AG AAT T T T C AGAAT T GGAAT AT 18 0 

I I I I I I I I I I M I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I Mill 
Db 347 GAGAAG GCAAAAACT C TACT CAT AGATAGAGAT T T AAC AGAGT T T T CAGAAT T AGAAT AC 4 06 

Qy 181 T CAGAAAT GGAAT CAT CAT T C AGT G G CT CT CAAAAG G C AGAAC C T GC C GTAAC AGTAG C G 2 40 

I I I I M I I II I I I I I I I I I I I I I I I I I I III II I I I I I I II II I I I MINI 
Db 407 T CAGAAAT G GGAT CAT C GT T C AGT GT CT C T C CAAAAGC AGAAT CT G C C GT AAT AGT AGC A 4 66 

Qy 241 AAT C CT AGGGAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAG G ACT T AGT TAGT C T T 3 00 

I I I I I I I I I I I I I I I I I I I I I I I || I | | | | | | | | | | | | | | | | | | || I I 
Db 4 67 AAT C C T AGG GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT TAGT TAGT AAT 526 



QY 



301 AACAT C CTT CAT ACT C AGCAGGAGTTATCT ACAGT C CT TAC GAAAT CAGTT GAAGAAGAA 3 60 



Db 527 AAC AT C CT T CAT AAT CAACAAGAGT TAC CT AC AG C T CT T AC TAAAT T G GT TAAAGAGGAT 586 

Qy 361 GAT AGAGT T CT GT CT C CAGAAAAAACAAAGGACAGTTT T AAGG7\AAAGGGAGTT GCAGCA 420 

1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I | | I I I I I I I I | | 
Db 587 GA AGTT GT GT C T T C AGAAAAAG C AAAAGACAGT T T T AAT GAAAAGAG AGT T GCAGT G 643 

QY 421 GAAGC T T CT AT G G GG GAG GAAT AT G CAG AC T T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 4 80 

I I I I II Mill I I I I I I I II I I I I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | | | | | | 

Db 64 4 GAAGCT C CT AT GAGG GAG GAAT AT GCAGAC T T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 703 

Qy 4 81 AAAGAT ACT T ACAAG C AAGAT AGT GAT GT T T T GAT T G C T GGAG GT AATAT AGAGAGCAAA 540 

I I M I II I Ml I II II I II I || | | M I I I I I I I I I M I || | || | | | | | 
Db 7 04 AAAGAT AGT AAG GAAGAT AGT GAT AT GTTGGCTGC T GGAGGT AAAAT C G AGAG CAAC 7 60 

Qy 541 TT GGAAGGTAAAGT GGATAAGAAACACT TTT CAGAT AGCCTT GAACAAACAAAT C GTGAA 600 

I I I I I I M I I I I I I II I I I I I I III I I I I I I I I I I I I I I I I I I I I I I Ml 
Db 761 T T GGAAAGTAAAGT GGAT AAAAAAT GT T T T GC AGAT AGC C T T GAGCAAACT AAT C AC GAA 820 

Qy 601 AAAGAT AGT GAAAGC AGT AAT GAT G AC ACT T CAT T T C C C AGT ACAC C AGAAGCT GT AAGA 660 

I I I I I I I I I I I M I I I I I I , I I I I I I I I I I | | M | | | | | | | | | | | | | | | | 
Db 821 AAAGAT AGT GAGAGT AGTAAT GAT GAT AC TTCTTTCCC C AGT AC G C C AGAAGGT AT AAAG 880 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAAC T GAGAAT GT T 717 

I I I I M I I I I I II Mill II II II II M I M M I II I I II II I II II 

Db 8 81 GAT C GT T CAGGAGC AT AT AT CACAT GTGCTCCCTT TAAC C C AGC AGCAAC T GAGAG CAT T 94 0 

Qy 718 T CAACAAACAT TTT T C C CTT GT T GGAAGAT C AT ACTT C GGAAAAT AAGAC AGAT GAAAAA 777 

M I I I I II I II II I II I I I I I I II I I I II II II I I I II II I I I I I I I I II I I I 
Db 941 G CAACAAACAT TTTTCCTTT GT T AG GAGAT C CT ACTT CAGAAAAT AAGAC C GAT GAAAAA 1000 

Qy 77 8 AAGAT AGAA-AAAAAAAG GCACAAAT T GT AAC AGAGAAGAAT G C AAGT GT C AAGACAT C A 836 

II I I I I M MM Mill Mill II M I I II I I II I II I || || I || M II 

Db 1001 AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AAC AGAGAAGAAT ACT AG C AC C AAAAC AT C A 10 60 

Qy 837 AAC C C T T T C CT T AT GGC AGC ACAGGAGT CT AAGACAGAT TAC GT T ACAAC AGAT CAT GT G 89 6 

I I I I I I M III I II II II II || I M | II II I M I II II I II II M I I II I 
Db 1061 AAC CCTTTTCTT GT AG C AGCAC AGGAT T CT GAGACAGAT TAT GT C ACAAC AGAT AAT T T A 1120 

Qy 8 97 T CAAAGGT GAC C GAGGAAGTAGT GGCAAACAT G C CT GAAGGT C TAAC C C C AGATT T GGT T 956 

M I II I II I I II II II II II II II II I I II II II II II II II II I II II I II 
Db 1121 ACAAAGGT GACT GAGGAAGT C GT GGCAAACAT GC CT GAAG GC C T GAC T C C AGATT T AGT A 1180 

Qy 957 CAG GAAGC AT GT GAAAGT GAAT T GAAT GAAGCT ACT GGT ACAAAAAT T G C C T T T GAAAC A 1016 

M I I I M I I M II II II I I I I I II II II II I II II II M I II I I II II I II II I II 
Db 1181 C AGGAAGCAT GT GAAAGT GAAT T GAAT GAAGT TAC T G GT ACAAAGAT T GCT T AT GAAAC A 124 0 

Qy 1017 AAAAT G GAC C T G GT T CAAACT T C AGAAG C T GT GCAG GAGT C ACT T TAC C C T GT AACAC AG 1076 

> I I I I M II II II II II II II II II I I II II I II I I II I II II II I II I II 
Db 1241 AAAAT GGACT T GGT T CAAAC AT C AGAAGT T AT GC AAGAGT C AC T CTAT C C T G CAG CAC AG 1300 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

M II II II I II I II II I II II II I II II I II I II II I II II II M II II II I I I 
Db 1301 CT T T G C C CAT CAT T T GAAGAGT C AGAAGC T ACT C CTT CAC C AGT T T T G C CT GACAT T GTT 1360 

Qy 1137 AT GGAAGC AC CAT TAAAT T CT GT AGT T C CTAGT GCTGGTGCTTCTG C AGT GCAG C T CAGT 1196 

M I II II II I M II II II II I I II I II II II II II I II II II I I MM Ml 



Db 13 61 AT GGAAG C AC CAT T GAAT T C T GC AGT T C CT AGT GCTGGTGCTTCCGT GAT AC AG C C C AG C 142 0 

Qy 1197 T CAT C AC CAT T AGAAAC TCTTCCTT C AGTTAAT TAT GAAAGC AT AAAGT T T G AGCCT GAA 1256 

I I I I I I I I I I I I I I I I I I I II II II I I I I I I I I ! I I I I I I I I I I I I I I I I I I 

Db 1421 T CAT C AC CAT T AGAAG C T TCTTCAGTTAATTAT GAAAGC AT AAAACATGAGCCT GAA 1477 

Qy 1257 AAT C C C C CAC CAT AT GAG GAGG C CAT GAAT GT AT C AC T AAAAAAAGAAT C AG GAAT GAAT 1316 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 147 8 AAC C C C C CAC CAT AT G AAGAGG C CAT GAGT GT AT CAC T AAAAAAAGT AT C AG GAAT AAAG 1537 

Qy 1317 GAAGAAATCACAGAGCCT GAAGGT AT T AGT GTAGCT GTT CAGGAAACAGAAGCT CCT TAT 137 6 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I II I I I I I I I II I I I I I 
Db 1538 GAAGAAATTAAAGAGC C T GAAAAT AT TAAT GCAGC T C T T CAAGAAAC AGAAGCT C C T TAT 1597 

Qy 1377 AT AT CT ATT GCAT GT GATTT AAT TAAAGAAACAAAGAT CT CTACT GAACC GACT CCAGAT 1436 

I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I III I I I I I I I I I I I III 
Db 1598 AT AT C T ATT GCAT GT GAT T TAAT TAAAGAAACAAAGC T TT C T G CT GAAC C AG C T C C G GAT 1657 

Qy 14 37 T T C T CT AGT TAT T C AGAAAT AGC AGAAGT T G CAC AG C C AGT GC C C GAGC AT T CT GAG C T A 1496 

I I I I I I I I I I II I I I I I I III I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I 
Db 1658 T T CT C T GAT TAT T C AGAAAT GG CAAAAGT T GAAC AGC C AGT G C CT GAT CAT T CT GAGCT A 1717 

Qy 14 97 GT T GAAGAT T C CT C C C C CGAT T CT GAAC CAGT T GAC T TAT T T AGT GAT GAT T C AAT AC C C 1556 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 1718 GTT GAAGAT T C CT CAC CT GAT T CT GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC CT 1777 

Qy 1557 GAAGT T C C AC AAAAAC AAGAT GAAG CT GT AAT AC T T GT G AAAGAAAAC C T CAC T GAAAT T 1616 

II I I I I I I I I I I II I I I I I M I I I I I I II I I I I II I I I I I M I I I II I I I I I 

Db 1778 GACGTT CCACAAAAACAAGAT GAAACT GT GAT GCTT GT GAAAGAAAGT CT CACT GAGACT 1837 

Qy 1617 T CAT CT GAGT CAAT GAC AG GAC AT GACAAT AAGG GAAAAC T CAGT GCTT CAC CAT CAC CT 1676 

I I I I I I I I I I I I I I I II I MM II II I I I I I I I II II I II I I I I I II I I 
Db 1838 T CATTT GAGT CAAT GAT AGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 1894 

Qy 1677 GAGGGAGGAAAAC C GT ATT T G GAGT CT T T T C AGC C CAGT TT AGGC AT C ACAAAAGAT AC C 1736 

I M II II I I I I I I I II II II I MUM III I I II II II I I I II M II I I I I I 
Db 1895 GAGGGAG GAAAG C CAT ATT T G GAAT C T T T T AAG CT CAGT T TAGAT AAC ACAAAAGAT AC C 1954 

Qy 1737 T T AGCAC CT GAT GAAGT T T CAG CAT T GAC C CAAAAG GAGAAAAT C C CT TT GC AGAT GGAG 17 96 

I M I I M II I I I II I I I MUM I II I I I II I I II II I I I I I M II I I I I I I 
Db 1955 CT GT T AC C T GAT GAAGT T T C AACAT T GAGCAAAAAGGAGAAAAT T C C T T T G C AGAT GGAG 2014 

Qy 1797 GAGCT CAAT ACT GC AGT T TAT T CAAGT GAT GG CT T AT T CAT T GCT CAG GAAG CAAAC CT A 1856 

I II II II I I I I I I I I II I I II II I Mill II II I I Ml II I I II I I II I II 
Db 2015 GAG CT CAGT AC T G C AGT T TAT T CAAAT GAT GACT TAT T TAT T T CTAAGGAAGC AC AGAT A 2074 

Qy 1857 AGAGAAAGT GAAAC AT T T T C AGAT T CAT CT CC GAT T GAGAT TAT AGAT GAGT T C C C GAC C 1916 

I I II I II II II I I I II I II II II II II M I I II M II II I I I II I I II I M I II 

Db 2 075 AGAGAAAC T GAAAC GT T T T C AGAT T CAT C T C CAAT T GAAAT TAT AGAT GAGT T C C C T AC A 2134 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II M II II I I II I I I II I I I I III I I I I II I II II I I I I I I II I I I II I II 

Db 2135 T T GAT CAGT T C T AAAACT GAT T CAT T T T CT AAAT T AGC CAG G GAAT AT ACT GAC C T AGAA 2194 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

M I M I I II II I I I I I I II I I II I M I I I I II I II II I I II II II I II I II 
Db 2195 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2254 



Qy 2 037 GGAT T G C CC C AT GAC CTTTCTTT CAAGAGT AT ACAAC C T AAAGAGG AAGT T CAT 2 090 

I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2255 GAAT T G C C C CAT GAC CT T T C T T T GAAGAAC AT ACAAC C C AAAGTT GAAGAGAAAAT C AGT 2314 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

I I I I I I II I II I I I I I I I I I III I I I II I I I I I I I I I I I I I I I I 
Db 2315 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2374 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AG GCAGCAT AGAAAAAC C CAAAGT T C T T 2210 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2375 GAT GTTTCTGCTTTGGC CAC T CAAGC AGAGAT AGAGAG CAT AGT T AAAC C C AAAGT T C T T 2434 

Qy 2211 GT GAAAGAAGCCGAGAGAAAACTT C CT T CT GATACAGAAAAAGAGCGAAGAT CT CCAT CT 2270 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I | I I I 
Db 2435 GT GAAAGAAGC T GAGAAAAAACT T C CT T C C GAT AC AGAAAAAGAGGAC AGAT CAC CAT C T 24 94 

Qy 2271 GCT AT AT T T T CAG CAGAG CT GAGTAAAACT T CAGT T GT T GAC CT C CT C T AC T GGAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2495 GC TAT AT T T T CAGCAGAGC T GAGT AAAAC T T CAGT T GT T GAC CT C CT GT AC T GGAGAGAC 2554 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I II I I II I I II I I M I I I I I I I I I I I I I I I I I I II I I I I I II I 
Db 2555 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 2 614 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I || I I I I M I 
Db 2615 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 2674 

Qy 2 451 TTTAGGATATATAAGGGT GT GAT C CAGGCT AT CCAGAAAT CT GAT GAAGGC CACCCAT T C 2510 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I 
Db 2675 T T T AGGAT AT ACAAGGGT GT GAT C CAAGCT AT C C AGAAAT C AGAT GAAGG C CAC C CAT T C 2734 

Qy 2511 AGGGC ATATTT GGAAT CT GAAGTT GCT AT AT CT GAGGAGTT GGTT CAGAAGTACAGCAAT 2570 

I I I M I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I III 
Db 2735 AGGG C AT AT CT GGAAT CT GAAGT T GCT AT AT CT GAG GAGT T GGT T C AGAAGT AC AGT AAT 2794 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 2795 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2854 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 2855 GATTTAGTTGATTCTCT GAAGTT TGCAGTGTT GAT GTGGGTATTTACCTATGTTGGTGCC 2914 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I M I I II I I I I II I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I II 
Db 2915 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 2 974 

Qy 2751 AT T TAT GAAC G GC AT C AGGC GC AAAT AGAT CAT TAT CT GG GACT T GC AAAT AAGAAT GT T 2810 

I I I I I M M II I I I II I I I I II I I I I II I I I I I M I I I I I I I I I I I I I I M I I I I I I 
Db 2975 ATTTAT GAAC GGCAT CAGGCACAGATAGATC ATTAT CTAGGACTT GCAAATAAGAATGTT 3034 

Qy 2811 AAAGAT G CT AT GG C T AAAAT C CAAG CAAAAAT C C C T G GAT T GAAG C GT AAAG C T GAAT G A 287 0 

I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 3035 AAAGAT GCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCT GAAT GA 30 94 



Qy 2 871 GAAAGC CT GAAAGAGT TAACAAT AGAG GAGT T T AT CT T T AAAG GG GAT AT T CAT T T GAT T 2 93 0 

II III I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I 

Db 3095 AAAC G C C C AAAAT AAT T A GT AG GAGT T CAT C T T T AAAG G G GAT AT T CAT T T GAT T 314 9 

Qy 2931 C CAT T G GG GAG G GT C AGGGAAGAAC AAAG C C T T GAC AT T G C AGT G C AGT T T C AC 2984 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3150 AT AC G G GG GAG G GT C AG G GAAGAAC GAA- C C T T GAC GT T G C AGT G CAGT T T C AC AGAT C G 3208 

Qy 2 985 AGAT CT T TAT T T T TAG CAAC GC AGT G- T C T GAGGAAAAAT GAC CT GT CT T GACT G 3038 

I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3209 TTGTTAGATCTTTATTTTTAGC CAT GCACTGTTGTGAGGAAAAATTACCTGTCTT GACT G 3268 

Qy 3039 CC CT GT GTT CAT CAT CT TAAGT ATT GTAAGCT GCT ATGT AT GGATTTAAAT C GTAAT CAT 3098 

II I I I II II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 32 69 CC AT GT GT T CAT CAT CT TAAGT AT T GTAAGCT GC T AT GT AT G GAT T T AAAC C GTAAT CAT 332 8 

Qy 3099 AT TT GT TT T T C C T GT AT GAG GC AC T GGT GAAT AAACAAAGAT C T GAGAAAGC T GT AT AT T 315 8 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3329 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC C T GT AT AT T 3375 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I III I I I I I I I I 
Db 337 6 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGA 34 35 

Qy 3215 GAC AGAAATAAC C CT T T T C AC AGT TT GT G CAC T GT GT AC G GT CT GT GT AGGTT GAT G 3271 

I I III II I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 3436 AAAAAAAAAAAAAAAGCCCTTTT CAGT TTGTGCACTGTGTAT GGT CCGTGTAGATT GAT G 3495 

Qy 3272 C AG AT T T T C T GAAAT GAAA T GT T TAG AC GAGAT CAT G C CAC C AAG GC AGGAGT GAA 3327 

II I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I II II I I I I I I II I 

Db 34 96 CAGAT T T T C T GAAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C GGTAAAGC AGGAAT GAC 3555 

Qy 3328 AAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 33 8 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 3556 AAAG CT T G- CT TT T CT G GT AT GT T CT AGGT GT AT T GT GAC T T T TACT GT TAT AT TAAT T G 3614 

Qy 3388 C CAAT AT AAGTAAAT AT AGATT AT AT AT AT CT AT AT AT AGT GT T T CAC GAAG CT T AGC C C 3447 

I I I I I I I I I II I I I I I I I I I I I I I I III I II I I I I I II I M I I I I I II I II 

Db 3615 C CAAT AT AAGTAAAT AT AGATT AT A TAT GT AT AGT GT T T CAC AAAGCT T AGAC C 3668 

Qy 344 8 TTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGT 3501 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 3669 TT T AC CT T - C C AGC CAC C C CAC AGT GC T T GAT AT T T C AGAGT CAGT CAT T G GT T AT AC AT 3727 

Qy 3502 GTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGT 3561 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I II I I I I II 

Db 372 8 GT GTAGTT C CAAAGCACATAAGCTAGAAGAAGAAATATTT CTAGGAGCACTACCAT CT GT 3787 

Qy 3562 T T T CAAC AC GAAC C GAC GC CAT G C AAAC AGAACT C C - T CAAC AT AAAC T T CAC T GCAC AG 3620 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 3788 T T T CAAC AT GAA AT GC CACACACATAGAACT C CAACAACAT CAATTTCATT GCAC AG 3844 

Qy 3621 ACTTACTGTAGTTAATTTTATCAC--AAACTCTGGACTGAATCTAATGCTTCCAAAAA — 3676 

III I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3845 ACT GACT GTAGTTAAT TTT GTC ACAGAAT CT AT GGACT GAAT CTAAT GCTT CCAAAAAT G 3904 

Qy 3677 T GT T T G C AAAT AT C AAAC AT T GT T AT GT AAGAAAAT AT AAATGACGA 3723 



Db 3905 TTGTTTGTTT GCAAAT AT CAAAC AT T GT TAT G CAAGAAAT TAT T AAT T ACAAAAT GAAGA 3964 

Qy 3724 T T TAT AC AAT T GT G GT T T AAGCT GT AT T GAAC TAAAT CT GT G GAAT GC AT T GT GAACT GT 37 83 

I I II I I I I I I I I I I II II I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I II I I 
Db 3965 TTT AT ACCATTGTGGTTT AAGCT GT ACT GAAC TAAAT CTGTGGAATGCATTGTG7VA.CTGT 4 024 

Qy 37 84 AAAAG CAAAGT AT CAAT AAAG C T TATAGACT T AAAA 3819 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 402 5 AAAAG CAAAGT AT CAAT AAAGCT T AT AGAC T TAAAA 4 060 
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Location/ Qualifiers 

1. .4070 

/organism="Homo sapiens" 

/mo l_t yp e= "mRNA" 

/db_xref-"taxon: 9606" 

/ chromosome="2 " 

/map= ,, 2pl6" 

/ tissue_type-"testis " 

1. .4070 

/gene="RTN4" 

/no t e= " s ynonym : NOGO " 

1. .143 

/gene="RTN4" 

144. .3104 

/gene="RTN4" 



/ note="RTN4-F; alternatively spliced" 
/codon_start=l 
/product="RTN4 isoform F n 
/protein_id="AAM64253. 1" 
/db_xref="GI :26800587" 

/translation="MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHE 
YLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSP 
KAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAK 
DSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKC 
FADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPL 
LGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 
SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENP 
PPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPD 
FSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLT 
ETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPL 
QMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE 
YTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATS 
KVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTS 
WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQ 
AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLK 
FAVLMWVFT YVGALFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMA 
KIQAKIPGLKRKAE" 
3 f UTR 3105. .4070 

/gene="RTN4" 

ORIGIN 

Query Match 70,9%; Score 2716; DB 9; Length 4070; 

Best Local Similarity 86.0%; Pred. No. 0; 

Matches 3332; Conservative 0; Mismatches 445; Indels 99; Gaps 25; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I II II 
Db 237 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 296 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTC7W^TGAAGCTTCTAAAGCATTCTCA 120 

IN I I I M I I I I I I I I I I I I I I I II I I I I I II I I I I I II I I I I I I I I I 
Db 297 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 356 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTT7\ACAGAATTTTCAGAATTGGAATAT 18 0 

M I I I I I I I I I I I II I I I I I I Ml I I I I I I I I I I II I I I I I I I I I '-'Ml 
Db 357 G AG AAG G C AAAAAC T C T AC T CAT AG AT AG AG AT T T AAC AGAGT T T T C AGAAT T AGAAT AC 416 

Qy 181 T C AGAAAT GGAAT CAT CAT T CAGT G G CT C T CAAAAGG C AGAAC CT G C C GTAAC AGTAG C G 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I II I I I I I I I I I I I I 
Db 417 T C AGAAAT GGGAT CAT C GTT CAGT GT CT CT CCAAAAGC AGAAT C T GCC GTAATAGTAGC A 476 

Qy 241 AAT C CT AGG GAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAG GAC T T AGT T AGT C T T 3 00 

I I I I I I M I I I I I I I I I I MM I II MM II I II I II I I I I II II I I I 
Db 47 7 AAT C C T AG GGAAGAAAT AAT C GT GAAAAATAAAGAT GAAGAAGAGAAGT TAGT T AGTAAT 53 6 

Qy 301 AACAT C CT T CAT ACT C AG C AGGAGT TAT CTACAGT C CTT AC GAAAT CAGT T GAAGAAGAA 360 

I M I I II II I II I II I II II M II I II I M II I I I I II I I II I II I I I 
Db 537 AACAT C CT T CAT AAT CAAC AAG AGT T AC C T ACAG C T C TT AC TAAAT T G GT TAAAGAGGAT 596 

Qy 361 GAT AGAGT T C T GT CT C C AGAAAAAAC AAAGGAC AGT T TT AAG GAAAAG G GAGT T GCAG C A 42 0 

M I I II II I II I I I II M I I I I I I II I I I II I I I II II II II II I I I M 



Db 597 GA AGT T GT GT CT T CAGAAAAAG C AAAAGACAGT T T TAAT GAAAAGAGAGT T G C AGT G 653 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 480 

I I I I I I Mill I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I | | I I | I | | | | | | | | 
Db 654 GAAG CT C CT AT GAGGGAG GAAT AT G CAGAC T T C AAAC CAT T T GAGC GAGT AT G G GAAGT G 713 

QY 4 81 AAAGAT ACT T ACAAGCAAGAT AGT GAT GT T T T GAT T G CT GGAGGTAAT AT AGAGAGCAAA 54 0 

I I I I I I I I Ml I I I I I I I I I I I I Ml I I I I II I I I I I I II I I I I I | I | 
Db 714 AAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCT GGAGGTAAAAT C G AGAG C AAC 770 

Qy 541 T T G GAAGGT AAAGT GGAT AAGAAAC ACT T T T C AGAT AG C CT T GAAC AAAC AAAT C GT GAA 600 

I I I I M I I I I I I I I I I I I I III III II M I I I I I | | || | | | | | | | | | Ml 
Db 771 T T G GAAAGTAAAGT GGAT AAAAAAT GT T T T GC AGAT AG C CT T GAGC AAAC TAAT C AC GAA 830 

Qy 601 AAAGATAGT GAAAGCAGTAAT GAT GACACT T CATTT CC CAGTACACC AGAAGCT GTAAGA 660 

I I I I I M I I I I II I I I I I I I I I II I I I II II I I I I II II I I I I I I I I IN 
Db 831 AAAGATAGT GAGAGT AGTAAT GAT GAT AC TTCTTTCCC CAGT AC GCC AGAAG GT AT AAAG 890 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AAC T G AGAAT G T T 717 

I I I I I I Mill M II I I I I II II I I II I || I I I | || | | | || | M | || 
Db 891 GAT C GT T C AGGAG CAT AT AT C AC AT GT GCT C C C T T T AAC C C AGC AGCAAC T GAGAG CAT T 950 

Qy 718 T CAACAAACAT TTTTCCCTTGTTG GAAGAT CAT ACT T C GGAAAATAAGAC AGAT GAAAAA 777 

M I I I M I I I I I II II II I I I I I I I I I I I M II M I II II I II I M I II I I I I 
Db 951 GCAAC AAACAT TT T T C CT T T GT TAG GAGAT C C TACT T CAGAAAATAAGAC C GAT GAAAAA 1010 

Qy 778 AAGAT AGAA- AAAAAAAGGCACAAATT GTAAC AGAGAAGAAT GCAAGT GT CAAGAC AT C A 836 

M M 11 M MM Mill I II I I II I I I II II I II II | | || Ml MUM 

Db 1011 AAAATAGAAGAAAAGAAGGC CCAAATAGTAACAGAGAAGAATACTAGCAC CAAAACAT C A 1070 

Qy 837 AACC C T T T C C T TAT GG CAG CAC AGGAGT CT AAGAC AGATT AC GT TACAAC AGAT CAT GT G 8 96 

M M M M M I I I I M I II M II II I I II II I II II II I II I I II M II I 
Db 1071 AACC CTTTTCTT GT AG CAG CAC AGGAT T CT GAGAC AGATT AT GT CACAAC AGAT AAT T T A 1130 

Qy 8 97 T CAAAGGT GAC C GAG GAAGT AGT G GCAAAC AT GC C T GAAGGT CT AAC C C C AGAT T T GGT T 956 

M M II II II II II I II I I I II I I II || I I I I M I | | | || || | | | | | | | | M 
Db 1131 ACAAAG GT GAC T GAGGAAGT C GT G GCAAAC AT G C C T GAAGGC C T GAC T C C AGAT T TAGT A 1190 

Qy 957 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAG C TAC T G GT ACAAAAAT T GC C T T T GAAAC A 1016 

M II I I I I II I I I I I I I I I I II II I I I I I | I I I I I I M II II I I I I I I I M I I || I 
Db 1191 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT TAC T G GT ACAAAGAT T GCT TAT GAAAC A 1250 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

M I M M M I II II II I M I II I II I I I I I I I II I I I I I || | | | | | | | | | | 
Db 1251 AAAAT G GACT T G GTT CAAAC AT C AGAAGT T AT GC AAGAGT C ACT CT AT C CT G CAG CAC AG 1310 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

M M M M II I I II II I I I II II I I I II II I I II M I I II I II II I I I II I I II 
Db 1311 CTTT GCC CAT CATTT GAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATT GTT 1370 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

M II I I II II II II I I I II II II II I I I I II I II I II I I I I | | | | | | I Ml 
Db 1371 AT GGAAG CAC CAT T GAAT T CT GC AGT T C C TAGT GCTGGTGCTTCCGT GAT AC AG C C CAG C 1430 

Qy 1197 T CAT CAC CAT T AGAAACT C T T C C T T CAGT TAAT TAT GAAAG CAT AAAGT T T GAG C CT GAA 12 56 

I M M M II I II I M I I I I I I II II II II II II I I II I II II I I I I II II II 
Db 1431 T CAT CAC CAT T AGAAGC TTCTT CAGT TAAT TAT GAAAGCATAAAACAT GAG CCT GAA 14 87 



Qy 1257 AAT C C C C CAC CAT AT GAG GAG GC CAT GAAT GT AT C ACTAAAAAAAGAAT C AGGAAT GAAT 1316 

M I I I I I I I I I II I I I I I I I I I I I I I II I I I I II I I I I || I || II I I I I I I I II 
Db 14 88 AAC C C C C CAC CAT AT GAAGAG GC CAT GAGT GT AT CAC TAAAAAAAGT AT C AG GAAT AAAG 1547 

Qy 1317 GAAGAAATCAC AGAGC CT GAAGGTATT AGT GTAGCT GTT CAGGAAACAGAAGCT C CT TAT 137 6 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I | | M I III I I I I I I I I I I 

Db 154 8 GAAGAAATTAAAGAGCCT GAAAATATTAAT GCAGCT CTT CAAGAAACAGAAGCT C CTT AT 1607 

Qy 1377 ATAT CTATT GCAT GT GATTTAATTAAAGAAACAAAGAT CT CT ACT GAACCGACT C CAGAT 1436 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I || I III I I I I I II I I I I IN 
Db 1608 ATAT C TAT T G CAT GT GAT T T AAT TAAAGAAACAAAG CTTTCTGCT GAAC C AG CT C C GGAT 1667 

Qy 1437 TT C T CT AGT TAT T C AGAAAT AG C AGAAGT T GCACAG C C AGT GC CC GAGC AT T CT GAGCT A 14 96 

I M I I I I I I I I I I I I I I I III I I I I I I I I I I I I I || I || || I I | | | | || | | | | 
Db 1668 TT CT C T GAT TAT T C AGAAAT GGCAAAAGT T GAAC AGC C AGT G C CT GAT CAT T CT GAGCT A 172 7 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T CT GAAC C AGT T GAC T TAT T T AGT GAT GAT T CAATAC C C 1556 

M I I I I I I M I I I I II I I I I I I I I I I II I I II I I I I I I I II I I I I I I I I II I II I I I 
Db 1728 GT T GAAGAT T C C T C AC CT GAT T C T GAAC C AGT T GAC T TAT T T AGT GAT GAT T CAATAC CT 1787 

Qy 1557 GAAGT T C CACAAAAACAAGAT GAAG C T GT AAT AC T T GT GAAAGAAAAC CT CAC T GAAAT T 1616 

I I I I I I I I I I I I I I I I I I I I I M II I I II I I I I I I I I I I I II I I I I I II I I I 
Db 17 8 8 GAC GT T C CACAAAAACAAGAT GAAACT GT GAT GCT T GT GAAAGAAAGT CT CACT GAGAC T 1847 

Qy 1617 T CAT CT GAGT CAAT GAC AGGAC AT GACAAT AAGGGAAAACT C AGT GCT T CAC CAT CAC CT 1676 

I I I I I I I I I I I I I 1 I II I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I 
Db 1848 T CAT T T GAGT CAAT GAT AGAAT AT GAAAATAAG GAAAAACT C AGT GCT T T G C CACCT 1904 

Qy 1677 GAGG GAG GAAAAC C GT AT T T G GAGT CT T T T C AGC C C AGT T T AGGC AT C ACAAAAGAT AC C 173 6 

I I I I I I I M I I II I I II I I I I IIM 1 III I I I I I I I I I I I I I I I I | | || || 
Db 1905 GAGGGAGGAAAGC C AT AT TT GGAAT C T T T T AAGCT C AGTT T AGAT AAC AC AAAAGAT AC C 1964 

Qy .1737 T T AGC AC C T GAT GAAGT T T C AGC AT T GAC C CAAAAG GAGAAAAT C C CTT T G CAGAT GGAG 1796 

I I I I I I I M I I I M I I I I II I I I I I I I I I I I II I II I I I I I I I I II I II I I I 
Db 1965 CT GT T AC CT GAT GAAGT T T CAAC ATT GAGCAAAAAG GAGAAAAT T C CTT T GC AGAT GGAG 2 024 

Qy 17 97 GAGCT CAAT ACT GC AGT T TATT CAAGT GATG GC T TAT T CAT T GC T C AGGAAGCAAACCT A 1856 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I II 
Db 2 025 GAGC T C AGT AC T G C AGT T T ATT CAAAT GAT GACT T AT T TAT T T C T AAGGAAGC AC AGAT A 2 084 

Qy 1857 AGAGAAAGT GAAAC AT T T T CAGAT T CAT CT C C GAT T GAGAT TAT AGAT GAGT T C C C GAC C 1916 

I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I II 
Db 2 0 85 AGAGAAACT GAAAC GT T T T CAGAT T CAT C T C CAAT T GAAAT TAT AGAT GAGT T C C CT AC A 2144 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

M M I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I I I I I I I I II 
Db 2145 T T GAT C AGT T C T AAAACT GAT T CAT T T T CTAAAT T AGC C AGGGAAT AT AC T GAC CTAGAA 22 04 

Qy 1977 GT AGC C C AC AAAAGT GAAAT T G CT GAC AT C C AGGAT G GAG C T G G GT CAT TGGCTTGTG C A 2 036 

Ml I I II I I I I I I I II I I I I I I I I II I I I I I I II I I I I I I I I II I I I I I II 
Db 22 05 GT AT C C C ACAAAAGT GAAAT T GC T AAT G C C C CG GAT GGAG CT GGGT CAT T G C C T T G CAC A 2264 

Qy 2037 G GAT T G C C C CAT GAC CTTTCTTT C AAGAGT AT AC AAC C TAAAGAGGAAGT T CAT 2090 

I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 22 65 GAAT T GC C C CAT GAC CTTTCTTT GAAGAACAT ACAAC C CAAAGT T GAAGAGAAAAT C AGT 2324 



Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I I I I I I I I I I I Ml II II II I I I I I I I I I I | | || | | M | | || 
Db 2325 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2384 

Qy 2151 GAT GTTTCTGCTTTG GAT G C T CAAGCAGAGAT AGG C AGC AT AGAAAAAC C CAAAGT T C T T 2210 

I I I I I I M I I I I II I I I I I II II II I I I I I I I I II I I I I || I I I I I I I I | | | | 
Db 2385 GAT GT T TCTGCTTTGGC C AC T CAAGCAGAGAT AGAGAGC AT AGT T AAAC C CAAAGT T CT T 2444 

Qy 2211 GT GAAAGAAG C C GAGAGAAAACT T C C T T C T GATAC AGAAAAAGAGC GAAGAT CT C CAT C T 2270 

I I I I I I I I M I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | I I I 

Db 24 45 GT GAAAGAAGC T GAGAAAAAACT T C C T T C C GATAC AGAAAAAGAGGACAGAT CAC CAT CT 2504 

Qy 2271 G CT AT ATT T T CAGC AGAGCT GAGT AAAAC TT C AGT T GT T GAC CT C CT CTAC T GGAGAGAC 2330 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I || | | | | | | | | | M | | | | | | | | | | 
Db 2505 G C T AT ATTT T CAGC AGAGCT GAGT AAAAC T T C AGT T GT T GAC C T C CT GT AC T GGAGAGAC 2564 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I M I I II I I I II I I I I I I I I I I I I I I I I | | | | | | I I | | | | | | | | I II I I I I I I I I 
Db 2565 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 2624 

Qy 2391 TT CAGCATTGT GAGT GTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGT GAC TAT CAGC 2450 

I I I I I I I M I I I I I I I I I I I I || I I I I || I || | | | || | | | | | || M II I I I MINI 
Db 2 625 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 2684 

Qy 2451 T T T AGGAT AT AT AAG G GT GT GAT C CAGGCTAT C C AGAAAT CT GAT GAAGG C CAC C CAT T C 2510 

M I I II I I I I I I I I I II I I I I M I I | | | | | | | | | | | | || | | | | | M I II I I I I I I I I 
Db 2685 T T T AGGAT AT ACAAGGGT GT GAT C CAAGCT AT C C AGAAAT C AGAT GAAGG C CAC C CAT T C 274 4 

Qy 2511 AG GGC AT AT T T G GAAT CT GAAGT T GCT AT AT C T GAG GAGTT G GT T C AGAAGT ACAGC AAT 2570 

I M I I I I I I M I I I I I I II I I I I I I I I I I I I I | | | | | | | | | | | | | | M II II I II III 
Db 2 745 AG G GC AT AT C T G GAAT C T GAAGT T GC T AT AT C T GAG GAGTT GGTT C AGAAGT AC AGT AAT 2804 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I I I I I I I I I I I I I I I Mill I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 2805 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2 864 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I I M I I I I I I I I I I I I I I I I 1 I I II I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I 
Db 2865 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 292 4 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I 
Db 2925 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 2 98 4 

Qy 2751 AT T TAT GAAC GGCAT CAGGC GCAAATAGAT CATTAT CT GGGACTT GCAAATAAGAATGTT 2 810 

I I I I I I I I I I I I I I I M I I I M I II I II I I I I I I II II I II I I I I I I I I I I I II I I I 
Db 2985 AT T TAT GAAC G G CAT C AG G CAC AGAT AGAT CAT TAT CT AG GAC T T GCAAAT AAGAAT GT T 3 04 4 

Qy 2811 AAAGAT GC T AT GGC T AAAAT CCAAGCAAAAAT C C CT G GAT T GAAG C GTAAAGCT GAAT GA 2870 

I I I I I I I I I I I M M I I I I I I I I I II I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I 
Db 3045 AAAGAT GC T AT GGCTAAAAT C CAAG CAAAAAT C C C T G GAT T GAAGC GCAAAG C T GAAT GA 3104 

Qy 2871 GAAAG C C T GAAAGAGT T AACAAT AGAGGAGT T TAT C T T TAAAG G GGAT AT T CAT T T GAT T 2 93 0 

> I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 3105 AAAC G C C C AAAAT AAT T A GT AG GAGT T CAT CT T TAAAG G GGAT ATT CAT T T GAT T 3159 

Qy 2 931 C CAT T G G G GAG GGT C AGG GAAGAACAAAG C C T T GAC AT T G CAGT G C AGT TT C AC 2 984 



Db 3160 AT AC GGGG GAGGGT C AGGGAAGAAC GAA- C C T T GAC GT T G C AGT GC AGT T T C AC AGAT C G 3218 

Qy 2985 AGAT C T T T AT TT T TAG CAAC GCAGT G - T CT GAGGAAAAAT GAC C T GT C T T GACT G 3038 

I I I I I I I I I I I I I I I I I I III II I I I I II I I I I I I I I I II II I I I I I I I 
Db 3219 T TGTTAGATCTTTATTTTTAGC CAT GCACTGTTGT GAGGAAAAAT TACCTGTCTT GACT G 3278 

Qy 3039 CCCTGTGTT CAT CAT C T TAAGT AT T GT AAG CT GC T AT GT AT G GAT T T AAAT C GT AAT CAT 3098 

M I I I I I I I I I I I I I I M I I I M I I I I II I I I I I I I I I I I II I I I I I I I I I I I II I II 
Db 327 9 C CAT GT GT T CAT CAT C T TAAGT AT T GT AAGCT G C T AT GT AT G GAT T T AAAC C GTAAT CAT 3338 

Qy 3099 AT T T GT T T T T C CT GT AT GAGGC AC T GGT GAATAAACAAAGAT CT GAGAAAGCT GT AT AT T 3158 

I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I I I I 

Db 3339 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC CT GT AT AT T 3385 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT— - 3214 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I || I I I | 
Db 3386 T T ACT T T GT T G CAGAT AGT C T T GC C GCAT CT T G G C AAGT T G C AGAGAT G GT G GAG CT AGA 3445 

Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATG 3271 

I I I I I I I I II II I II I I I | | | | | | | | || | M | | | || | | M | | | | 
Db 3446 AAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATG 3505 

Qy 3272 CAGATTTTCTGAAATGAAA T GT T T AGAC GAG AT CAT G C C AC C AAG G C AG G AGT GAA 3327 

I I I 1 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II II I I M I I III 

Db 350 6 CAGAT T T T CT GAAAT GAAAT GT TT GT T T AGAC GAGAT CAT AC C GGT AAAGCAG GAAT GAC 35 65 

Qy 3328 AAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 3387 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3566 AAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTG 3624 

Qy 338 8 C CAAT AT AAGTAAAT AT AGAT TAT AT AT AT CT AT AT AT AGT GT T T C AC GAAGC TT AG C C C 3447 

M I I I I I I I M I I I I I I I I I I | | | | Ml I I I I I | | | I I I I I I II I I I I I II 

Db 3625 C CAAT AT AAGT AAAT AT AGAT TATA TAT GT AT AGT GTTTCACAAAGCTTAGACC 367 8 

Qy 344 8 T T T AC CT T C C C AGCT GC C C C AC AGT G C T T GAT ACT TCTGTCATGGGTTTTATGT 3501 

I M I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3679 T T T AC CT T - C CAGC C AC C C C ACAGT GCT T GAT AT T T CAGAGT C AGT CAT T GGTT ATAC AT 3737 

Qy 3502 GT GTAGT C C CAAAG C AC AT AAG CT AG G GAGAAAC GT ACT T C TAG G C G CACT AC CAT C T GT 3561 

I I I I I M I I I I I I II I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I M I 

Db 3738 GT GTAGT T C CAAAGC ACATAAG CT AGAAGAAGAAAT AT T T CT AG GAGCACT AC CAT CT GT 3797 

Qy 3562 T T T CAACAC GAAC C GAC G C CAT G C AAAC AGAAC T C C - T CAAC AT AAAC T T C AC T GCAC AG 3620 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3798 T T T CAAC AT GAA AT G C C AC ACAC AT AGAACT C CAACAAC AT CAAT T T CAT T GCAC AG 38 54 

Qy 3621 AC T T AC T GTAGT T AAT T T TAT C AC — AAACT CT GGACT GAAT CTAAT GCTT CCAAAAA — 3676 

Ml I II I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I || I I I I I I I I I I I I 
Db 3855 ACT GACT GTAGT T AAT T T T GT C ACAGAAT C TAT GGAC T GAAT CTAAT G C T T C CAAAAAT G 3914 

Qy 3677 T GTTT GCAAATAT CAAACATT GT TAT GT AAGAAAAT AT AAAT GAC G A 3723 

I I I I I I M I I I I I M | Ml I I I I I I II 

Db 3915 TTGTTT GTTT GCAAATAT CAAACATT GTT AT GCAAGAAAT TATTAATTACAAAAT GAAGA 3974 

Qy 3724 T TT ATACAAT T GT G GT TTAAGCTGT ATT GAAC TAAATCTGTGGAATGC AT TGT GAAC TGT 3783 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I | I I I I I I | | | | | | | | | M I I || 



Db 



3975 T T T AT ACC AT T GT GGT T T AAG C T GT AC T GAACTAAAT C T GT G GAAT GC AT T GT GAACT GT 4034 



Qy 37 84 AAAAGCAAAGTAT CAATAAAGCTTATAGACTTAAAA 3819 

I I I I I I I I I I i I II I I I I I I II I I I I I I I I I i II I I 
Db 4 035 AAAAGCAAAGTAT CAAT AAAG C T T AT AGACT T AAAA 4 07 0 
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ORIGIN 



Query Match 70.9%; Score 2716; DB9; Length 4102; 

Best Local Similarity 86.0%; Pred No 0; ^ 25; 

Matches 3332; Conservative 0; Mismatches 

1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

6! GTACTGCCCACTGAAGGAACACTTCC^GCAACTTCA^^ 120 

Qy 121 gagaaggcaaaaaatccatttgtagagagaaat^ 180 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGC^GAACCTGCCGTAACA^ 240 

2,1 RATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGRAGAGGACTTAGTTAGTCTT 300 
301 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

0y 361 „C ; G,C r CAGAA r C^G = ^ «« 

629 GA agTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 685 

421 GAAGCTTCTATGGGGGAGGAATAIGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4B0 

: :: ====== - 



481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATA^ 540 

i I I l l M I III I I I I I I I I I I I I HI M M M I I I I I I II I M M I I I 

746 AAAGATAGT jJ^GGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 802 

TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGA 

Ml Ml I I I II I I I I I I I I Ml Ml M III MM MM I I Ml MM I [[ 



541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

i I I l II I II II II II II II Ml Ml I I II II M II I M I II I I MM I M 

803 TTGGAAAGTAAAGTGGM 862 

601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTC^ ^60 
l I I l I I II II I M M II I II M M II M I M I I II II I I II I M M I II I 

863 AAAGATAGT GAG AGT A 922 
661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC---CAACAACTGAGAATGTT 717 

I I | I I I I | I I I II Mill II I II I I I M II I I M M II II I M II M 

923 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 



718 



TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 



982 
777 



, , I I i| I M | | | | | I I II I I II I M I M I M II II M MM 

983 GC AAC AAAC AT TTTTCCTTTGT T AGG AGAT C C TACT T CAG AAAAT AAGAC CGAT GAAAAA 1042 



836 



778 AAGAT AGAA- AAAAAAAGG C AC AAAT T GT AAC AGAGAAGAAT GC AAGT GT C AAGAC AT C A 

II 11,11 MM IMM I I II I II II I M II I M I II I M M I II M II 

1043 AAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCA 1102 



837 



1103 



AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 896 

M i I I l l I III I I I II I M M II I M I M II M I II II II II I M I I II I 
aaccctUtcttgtagc^ us 



897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I l I I I I II I II II II M M M M M M I II II II I II M M M II M II I I 

ACAAAGGTGA^^ 1222 
CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

i i II l l I I II II II II I II II M M I M M I I I II II II M M Mill I II I M M 

CAGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT T ACT GGT ACAAAGAT T GCTT AT G AAAC A 1282 



1163 



957 



1223 

1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

i i i i I I l M I M II II II I I M II II I MM II M II M II II II I I I I M 

AAAATGGACTTGGTTCA 1342 
CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

i I il II II 1 I I I II I M I I II II II I I I M I I Mill I II II M II II M II M 

CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1402 



1283 



1077 



1343 



1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCA.GT 1196 
14 03 ATGGAAGCACCATTGAATTCTGC 

1256 



1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 

. i i i I i i i i i i i i I I | | | I || | || M || || I I M I M II I M II II M I 

1463 T CAT C AC C ATT AGAAGC T T CTT C AGTT AAT TAT GAAAG C AT AAAAC AT GAG C CT GAA 1519 



Ml | | | | | I I II M I II M II I I M II I II I M II 

: AT C AC C ATT AGAAGC T T CT T C AGTT AAT TAT GAAAG CAT AAA 

1257 AATCCCCCACCATATGAGGAGGCCATGAATGT^ 1316 

ii i i i i i i i I I I I I I I I M I t I I I II I M I I 1 I II I I M M i M 

152 o JJ^^iiii^^ 1579 



1317 
1580 
1377 
1640 
1437 
1700 
1497 
1760 



GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 137 6 

I 1 I I I I I I | I I I I I I I M I IIMI M I I I I I''' I I I I I I I I N II I I I I I I 
GAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTAT 1639 



ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

| | | | | | | | I I I M I M I I I I I I I I I I I I I I I I M I I I Ml I M I I I I I M I HI 
ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 



TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 

I I I I I | | l | | I I I I I I I I III MINI I I I I I I I t I I I I M I I I I I I I I I I I I 

TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 
GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 

| | | | I M I I I I I I I II I I I I I I I I I I I I I I I I I I M I M I I I I M I M I I I I I I I I I 

GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 1819 



1699 
1496 
1759 
1556 



1557 GAAGTT CCACAAAAACAAGATGAAGCTGTAATACTT GTGAAAGAAAACCTCACTGAAATT 

I MINI I I Mil M I II II I MM I II MIMMI I I 

GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 



1820 
1617 
1880 
1677 
1937 
1737 
1997 
1797 
2057 
1857 
2117 
1917 
2177 
1977 



1616 
1879 
1676 



TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 

I I I I I I I I I II I I I I II I MM I II M I I I I I I M II I M I I I I I I M I 

TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC— CACCT 1936 
GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 

I I M I || II I I MM MUM I MM I M MMMIMI I 

GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 
TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 

I m | | || || | || | I I II II I I I I I I I I M II Ml M I M II II M I I I II I I 

CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 
GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 

MM||| in | | || M I II Ml M I II I I I I M II Ml II MIMMI I I I 

GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 
AGAGAAAGT G AAAC ATT T T C AGAT T CAT CT C C GAT T GAG AT T AT AGAT G AGTT C C C GAC C 

I Mill II Ml II MM I Mill III III III MIMMI II 

AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2176 



1736 
1996 
1796 
2056 
1856 
2116 
1916 



TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 
II | | M II II M I I Mill I I Ml I II I II I II II II I I M II II I I M I I 

TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2236 



GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

M | 1 I M III II I I I I I II II M I M II II I II I I M M I I M II I I I I M 
2237 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2296 

2037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2090 

| | I I I II I M I I M M I I M M I I M MIMMI I I I I M M I I 

GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2356 



2297 
2091 
2357 



GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

M Ml II II I I II I M II M M I II I II I I I M M I M II II M 

TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2416 
2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 



2417 



I 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 N 

GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 2476 



2211 GT GAAAGAAGCCGAGAGAAAACTT C CT T CT GAT ACAGAAAAAGAGCGAAGATCT CCAT CT 

I I | | | | | | M | M I I I I I I I I II I I I I I I I M I II I I M I M I I I I I I I I I I I 
OTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 



2477 



2270 
2536 
2330 



2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 

MM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

2537 GCTATATTTTCAGCAGAGCTGAGTAAAACTT CAGTTGTTGACCT CCT GTACTGGAGAGAC 2596 



2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 

M | || I I I I I M I I II I I I M I I I II I I I I I I I I I I I I I II II 

2597 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 

2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 

M | | || | M I I I I I I I I I I I I M I I I I I I M II I I I M II I M I I M I I I I I I I I I I 
2657 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 



2390 
2656 
2450 
2716 
2510 



2451 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 

| I | | | | | | I | I I II I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 2776 



2717 
2511 
2777 
2571 
2837 



2570 



AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 

Mill I III MINI I IIIIIIIMIIIIIIIIIIIIIIIII IN 

AGGG CAT AT CT GGAAT C T GAAGTT GCT ATAT CT GAGGAGT T GGTT C AGAAGT ACAGT AAT 2836 



TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 

I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I l|NII!|iii!iiiIii 



2630 



TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2 8 96 

2690 
2956 



2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 

I M I I I I I II I M I I I M I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I M I II I I I I I 
2897 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 



2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 

I M |l II M M I I II M I I I I I I I I I I I I I M I I M I I I II I I I I I I I I I I I M I I I 
2957 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 

2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 

M | M II I I M I I I I I M I I I I I I I I I I II I I I I M I I I I I I I I M I I 

3017 ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 



2750 
3016 
2810 
3076 
2870 



2811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGA 

M II I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I 
3077 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGA 3136 



2871 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 

M | | | | | I I III I I I I II I II I I I I I I I I I I I I I I I I I I I I 

3137 AAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 



2930 
3191 
2984 



2931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 

I I I I I I I I II I I I I I I I M II II I I I I I I I I I I I I I I I I I I I I I II I 
3192 ATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3250 

2985 AGATCTTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTG 3038 

II MINIUM I Ml M I IMIMMIM I I I I I I M 



51 T T GT T AGAT C T T TAT T T T T AGC C AT G C ACT GT T GT GAGGAAAAAT T AC CT GT CT T GAC T G 3310 



3039 CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 

M || Ml || Mill I II III 11 MUM II Ml II I II IN M HUM I III II I II 

3311 CCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCAT 



3098 
3370 
3158 



30 99 AT TTGTTTTTCCT GT AT GAG G CAC T GGT GAAT AAACAAAG AT CT G AGAAAGCT GT AT AT T 

MM I II III II I II I II I I 1 111 I I I I I I I It 

3371 A— TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATT 3417 



3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

|| I II II II I I I I II M I II I I I I I I I I I I I I I I I I 

T TACT T T GT T G CAGAT AGT CT T G C C G CAT CT T GG CAAGT T GC AGAGAT G GT GGAG C T AGA 



3418 



3477 



3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATG 3271 

I | | M M I II I I M I I M I I I I I I I I M II I I I M I I I I I M II 

AAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATG 



3478 



3537 



3272 CAGAT T TT CT GAAAT GAAA T GTTT AGAC GAGAT CAT GCCAC CAAGGCAGGAGT GAA 3327 

| || I I I M II I I I I M M I I I I I 1 II I I I I I I 11 I I II M I IN 

CAGAT T T T CT GAAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C GGT AAAGC AGGAAT GAC 



3538 
3328 



3597 
3387 



AAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 

|| || M I I II I II I II I I I M I I I I I I I MM II M M I M 

3598 AAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTG 3656 



C CAAT AT AAGT AAAT AT AGAT TAT AT AT AT C TAT AT AT AGT GT T T CAC GAAGCT T AGC C C 

| | | | | I 1 I I I I I i I I I i 1 I t t I I I I Ml I I II I I II I I I M MM 



3447 



3388 

3657 CCAATATAAGTAAATATAGATTATA TAT GTAT AGT GTTT CACAAAGCTT AGAC C 3710 



3448 TTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGT 3501 

. | | M II II II M I II I I II II I M I II I M I I I M II I II M II I 

3711 T T T AC CT T - C C AGC CAC CC CAC AGT G CT T GAT AT T T C AGAGT CAGT CAT T GGT TAT AC AT 3769 

3502 GT GTAGT C CCAAAGCACATAAGCTAGGGAGAAACGTACTT CT AGGCGCACTACCAT CT GT 3561 

|| M I I I M M I I II M I I M M I I I I M I I M I M II I II M I I II I I I 

377 0 GT GT AGT T C CAAAG C ACAT AAG CT AGAAGAAGAAAT AT T T CT AGGAGC ACT AC CAT CT GT 



3829 



3562 T T T C AAC AC GAAC C G AC GC CAT G CAAAC AGAACT C C - T CAACAT AAACT T C ACT G CAC AG 3620 

| | | M I M I M I II I I 1 I I M II II II II I I II II I I I I M M I I I 
3 8 30 T T T CAACAT GAA AT GCCACACACAT AGAACT CC AAC AACATCAATTTCATTGCACAG 3 88 6 



3621 
3887 
3677 
3947 
3724 
4007 



ACTTACTGTAGTTAATTTTATCAC— AAACT CTGGACT GAAT CTAATGCTTCCAAAAA— 367 6 
| M I I I II I II I I I I M I I M I I I M II I I M I I I I I M II I M II I M I I I 
ACT GACT GTAGT TAATTTT GTCACAGAATCT AT GGACT GAAT CTAAT GCT T CCAAAAAT G 3946 

T GT T T GCAAAT AT CAAAC AT T GT TAT GT AAGAAAAT AT AAAT GAC G A 3723 

| | M | || | M II M M II I I I I I I I M M I II I I M MUM II 

TTGTTTGTTT GCAAAT AT CAAAC AT T GT TAT G C AAG AAAT TAT T AAT T AC AAAAT GAAGA 



T T T AT AC AAT T GT G GT T T AAG CT GTAT T G AACT AAAT C T GT G GAAT G CAT T GT GAAC T GT 

I || | | || I I II M II II I I I I I II II I I I II I M I M I I II M I M M II II I 

TTTATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGAACTGT 



4006 
3783 
4066 



37 84 AAAAGCAAAGT AT CAAT AAAGC TT AT AGACTTAAAA 3819 

I | II I I I II I II I II M M M II I II II II II I M I 
4067 AAAAGCAAAGT AT C AAT AAAG CT TAT AG AC T T AAAA 4102 
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AY123248 4109 bp mRNA linear PRI 23^SEP-2003 

Homo sapiens RTN4 isoform E (RTN4) mRNA, complete cds ; 
alternatively spliced. 
AY123248 

AY123248.1 GI:26800583 

Homo sapiens ( human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 4109) 

Oertle,T., Huber,C, van der Putten,H. and Schwab, M.E. 

Genomic structure and functional characterisation of the promoters 

of human and mouse nogo/rtn4 

J. Mol. Biol. 325 (2), 299-323 (2003) 

22376540 

12488097 

2 (bases 1 to 4109) 
Van der Putten,H. 
Direct Submission 

Submitted ( 07-MAY-2002 ) Nervous System Research, Novartis Pharma 
Inc., Basel, Switzerland 

3 (bases 1 to 4109) 
Oertle,T. and Schwab, M.E. 
Direct Submission 

Submitted ( 17-MAY-2002 ) Brain Research Institute, University of 
Zurich and ETH Zurich, Winterthurerstr . 190, Zuerich 8057, 
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Location/Qualifiers 

1. .4109 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/ ch r omo s ome = " 2 " 
/map="2pl6" 
/tissue_type="testis" 

1. .4109 
/gene="RTN4" 
/note="synonym: NOGO" 
1. .182 
/gene="RTN4" 
183. .3143 
/gene="RTN4" 

/note="RTN4-E; alternatively spliced" 
/codon_start=l 
/product="RTN4 isoform E n 
/protein_id="AAM64252 . 1" 
/db_xref ="GI : 26800584" 

/translation="MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHE 
YLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSP 
KAES AVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEWS S EKAK 
DSFNEKRVAV^PMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKC 
FADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPL 



LGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 
SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENP 
PPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPD 
FSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLT 
ETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPL 
QMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE 
YTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATS 
KVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTS 
WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 
AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLK 
FAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGIANKNVKDAMA 
KIQAKIPGLKRKAE" 
3 1 UTR 3144. .4109 



/gene="RTN4' 



ORIGIN 



Query Match 70.9%; Score 2716; DB 9; Length 4109; 

Best Local Similarity 86.0%; Pred. No. 0; 

Matches 3332; Conservative 0; Mismatches 445; Indels 99; Gaps 25; 

CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

M I I I I I I I I I I I I I I I I I M I I 1 M I M I I I I I I M M 

CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 335 

GT ACT GC C C AC T GAAGGAACACTT C CAG CAAC T T CAAAT GAAG CTT CT AAAGC AT T C T C A 12 0 

Ml | | | II I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I 

GT AT T AC C C ACT GAAGGAACACT T CAAGAAAAT GT C AGT GAAG CT T CT AAAGAGGT CT C A 395 

GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T C AGAATT G GAAT AT 18 0 

M | M I I I I I I II M I I Mil III I I I I I I I I M I I II 1 I I II M Mill 

GAGAAGGCAAAAACT CTACT CAT AGATAGAGATTTAAC AGAGTTTT CAGAATT AGAAT AC 4 55 
T CAGAAAT GGAATCAT CATT CAGT GGCT CT CAAAAGGCAGAAC CT GCCGTAACAGTAGCG 24 0 

I I I I II I II II I I I M I M I M 

T CAGAAAT GGGAT CAT C GT T CAGT GT CT CT C CAAAAGC AGAAT CT GC C GT AAT AGT AG C A 515 
AAT C CT AGGGAC GAAAT AGT T GT GAGGAGT AGAGATAAAGAAGAG GACT T AGT T AGT C T T 300 

| | || M I II M I I I II I I I I M I II I I I I I M M I II I I I I M I I M I 

AAT CCTAGGGAAGAAATAAT CGT GAAAAATAAAGAT GAAGAAGAGAAGTT AGTT AGTAAT 57 5 
AAC AT C CT T CAT ACT C AGC AGGAGT TAT CT AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 

M II II I I M M I I II I I I II II I M I I I I I M I II I I M I I I 

AACAT C CTT CAT AAT CAACAAGAGTTAC CTACAGCT CTT ACTAAATT GGTTAAAGAGGAT 635 
GAT AGAGTT CT GT CT C C AGAAAAAACAAAG G AC AGT T T T AAG GAAAAG GG AGT T G CAGC A 420 

|| | M I II II I I I I II I II II I I I II II I M M I M M II I I II II I I I 

GA AGTT GT GT C T T C AGAAAAAG CAAAAG AC AGT T T T AAT GAAAAGAG AGT T GCAGT G 692 



Qy 


1 


Db 


276 


Qy 


61 


Db 


336 


Qy 


121 


Db 


396 


Qy 


181 


Db 


456 


Qy 


241 


Db 


516 


Qy 


301 


Db 


576 


Qy 


361 


Db 


636 


Qy 


421 


Db 


693 


Qy 


481 


Db 


753 



GAAG CTT CT AT GGG GG AGGAAT AT GCAGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 

| I II I I I I I M II II I M I II II I I I II I II I M II I I II I II 

GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 



480 
752 



AAAGATACTTACAAGCAAGAT AGT GAT GTTT T GATT GCT GGAGGTAAT AT AGAGAGCAAA 540 

M II II I I M I M II I II II M I III II I I I I Mill 

AAAGAT AGT AAG GAAGAT AGT GAT AT GTTGGCTGCTG GAG GT AAAAT C GAGAGC AAC 8 09 



541 T T G GAAGGT AAAGT GGAT AAGAAACAC T T T T C AGAT AGC CT T GAACAAACAAAT C GT GAA 600 

| | I I I I 1 I I I 1 M I i I I I I II I I I I I I I I I I I I I I I I I II I I I M 

810 T T G GAAAGT AAAGT GGAT AAAAAAT GT T T T GC AGAT AGC C T T GAG CAAAC T AAT C AC GAA 869 

601 AAAGAT AGT GAAAGCAGTAAT GAT GACACTT CATT T CCC AGT ACACCAGAAGCT GTAAGA 660 

| I I I I I I I I I I II I II I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I M 
87 0 AAAGAT AGT GAGAGT AGT AAT GAT GAT ACT TCTTTCCC C AGT AC G C C AGAAGGT AT AAAG 92 9 

661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

| | | | I I I I I II II I I I I I I M I I I I I II I I I I I I M I I M I I I I I II 
930 GAT C GT T C AGGAGC AT AT AT CAC AT GTGCTCCCTT T AAC C C AGC AG C AACT GAGAG CAT T 989 

718 T CAACAAACAT TTTTCCCTTGTTG GAAGAT C AT ACTT C G GAAAAT AAGAC AGAT GAAAAA 777 

M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I M I i 
990 GC AACAAAC AT TTTTCCTTT GT T AGGAGAT C CT ACTT CAGAAAAT AAGAC C GAT GAAAAA 1049 

77 8 AAGAT AGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGACAT CA 836 

I I I I I I I I I I Mill I I I I I II I I I I I I I I I II IN I I I I I I 

1050 AAAAT AGAAGAAAAGAAG G C C C AAAT AGT AAC AG AGAAGAAT ACT AGC AC C AAAAC AT C A 1109 

837 AAC C CT T T C CT TAT GGCAGC AC AGGAGT CT AAGACAGAT T AC GT T ACAAC AGAT CAT GT G 8 96 

I I I I I I I I Ml I I I II I I I I I M III MINIMI! II I I I II I I I I II I 

1110 AACC CT TT TCT T GTAGCAGCACAGGAT T CT GAGACAGATT AT GT C ACAACAGATAATT T A 1169 
897 T C AAAGGT GAC C GAGGAAGT AGT GG CAAAC AT G C C T GAAG GT C T AAC C C C AGAT T T GGT T 956 

M | I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M M I II II I I M I I I I II 

1170 ACAAAGGT GACT GAGGAAGT C GT G GCAAAC AT GC CT GAAGG C CT GACT C C AGAT T T AGT A 1229 

957 C AGGAAGCAT GT GAAAGT GAAT T GAAT GAAGCT ACT GGT ACAAAAAT T GC CT T T GAAAC A 1016 
| | | I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
1230 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGT TACT G GT ACAAAGAT T GC T TAT GAAAC A 128 9 

1017 AAAAT GGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

| | | I | I I I I I I I I I I I I I I I I I I I I I I MM I II I I I I I. M II I I I Mill 
1290 AAAAT GGACT T G GT T CAAAC AT C AGAAGT T AT G CAAGAGT C ACT CT AT C CT GC AG CACAG 1349 

1077 CTTTGCCCATCTTTTGAAGAATCT GAAGCT ACT CCGTCACCGGTTTTGCCTGACATTGTC 1136 

| | | M II II M II I I II II II II II II I I M I I M M II I II I I I I M M II M 
1350 CT T T GC C CAT C ATT T GAAGAGT C AGAAGC T ACT C CT T CAC C AGT T T T G C C T GAC AT T GT T 1409 

1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

|| || || || | | | | I I II II I II I II II I I I I I II II I I I I I I I I I M M II I 
1410 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 14 69 

1197 T CAT CAC CAT T AGAAAC TCTTCCTT C AGT T AAT TAT GAAAGC AT AAAGT T T GAGC C T GAA 1256 

|| | | | | M II I II I I I I II I I II I I II I I M I II I II II I I I I M M II I I I 
1470 T CAT CAC CAT T AG AAGC TT CT T C AGT T AAT TAT GAAAGC AT AAAAC AT GAG C CT GAA 152 6 

1257 AAT C C C C CAC CAT AT GAG GAGG C CAT GAAT GT AT CACT AAAAAAAGAAT C AGGAAT GAAT 1316 

M II II I II I II II II I I I I I I M M I II II I I I I M II II II I I M I II I I M 
1527 AAC C C C C CAC CAT AT GAAGAG G C CAT GAGT GT AT CACT AAAAAAAGT AT C AGGAAT AAAG 1586 

1317 GAAGAAAT CACAGAGCCT GAAGGTATTAGTGTAGCTGTT CAGGAAACAGAAGCTCCTTAT 137 6 

| | M II II I I II I I I I I I I Mill II I II I I II I M I M I II I I I II I M I I 
1587 GAAGAAAT T AAAGAGCCT GAAAAT AT T AAT G CAGCT CT T CAAGAAAC AGAAG CT C C T TAT 1646 

1377 AT AT CT AT T GCAT GT GAT TTAATTAAAGAAAC AAAGAT CT CTACT GAAC CGACT C CAGAT 1436 



1647 AT AT CT AT T G CAT GT GAT T T AAT T AAAGAAACAAAGCT T T CT GC T GAAC C AGC T C C GGAT 1706 

14 37 T T C T C T AGT T AT T CAGAAAT AG C AGAAGT T G C AC AG C C AGT GC C C GAG CAT T CT GAGC T A 1496 

11 I I I I I I I MM Ml MM I 

1707 T T CT CT GAT TAT T CAGAAAT G G C AAAAGT T GAAC AGC C AGT G C CT GAT CAT T C T GAGC T A 1766 

14 97 GT T GAAGATT CCT CC CC C GAT T CT GAAC CAGTT GACTT AT TT AGT GAT GATT CAAT ACC C 1556 

I M M M I M M M M M M M M I M M M M M M M M M I 

1767 GTT GAAGAT T C CT CACCT GATT CT GAAC CAGTT GACTT ATTTAGT GAT GAT T CAAT AC CT 1826 
1557 GAAGT T C C ACAAAAACAAGAT GAAGCT GT AAT ACT T GT GAAAGAAAAC C T CAC T GAAAT T 1616 

M M M M M I M M M I MM I I M M M M M M I M M M M I I 

1827 GAC GT T C C ACAAAAACAAGAT GAAAC T GT GAT GC T T GT GAAAGAAAGT C T C ACT GAGACT 18 86 
1617 T CAT CT GAGT CAAT GACAGGAC AT GACAAT AAGGGAAAAC T CAGT GC T T CAC CAT CAC C T 167 6 

MM M M M I M M M I MM M M M I M M M M M M M I M M I 

1887 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAACT CAGT G CT T T GC CACCT 1943 

1677 GAG GGAGGAAAAC CGT AT T T GGAGT CT T T T C AG C C CAGT T T AGGCAT C ACAAAAG AT AC C 1736 

M M M M MM MM Ml M I M M I I Ml I 

1944 GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 2 003 
1737 TTAGCACCT GAT GAAGTT T CAGCATT GAC CCAAAAGGAGAAAAT C CCTTT GCAGAT GGAG 17 96 

I M M M M I M M I M M I M I Ml I 

2004 CT GT T AC CT GAT GAAGT T T CAAC AT T GAGC AAAAAGGAGAAAATT CCTTT GCAGAT GGAG 2063 
17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 18 56 

II || M I I I I I M I I M M I M I I Mill I I II M III M II I II II I I II 

2064 GAGCTCAGTACT GCAGTTTATTCAAATGATGACTTATTTATTT CTAAGGAAGCACAGATA 2123 

1857 AGAGAAAGT GAAAC AT T T T C AGAT T CAT CT C C GAT T GAGAT T AT AGAT GAGT T C C C GAC C 1916 

M I I I I I I II I I I I I I I I I I I II I I I I M I Mill I I I M II M I II I I I I I II 
2124 AGAGAAAC T GAAACGT T T T C AGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C CC T AC A 2183 

1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 197 6 

|| || | M I I I II I I I I I I I I I I I I I I I I I M I I I I I II I I II I I I I I I I M 
2184 TT GAT CAGTT CTAAAACT GATT CATT TT CT AAATTAGC CAGGGAATATACT GACCTAGAA 2243 

1977 GT AG C C CACAAAAGT GAAAT T G CT GAC AT C C AGGAT GGAG C T GGGT CAT TGGCTTGT GC A 2 036 

I || II I I I I I I I I II I II I M II I I II I I I I M I M I I M II II M 

2244 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 23 03 

2 037 G GAT T G C C C CAT GAC CTTTCTTT C AAG A G T AT AC AAC C TAAAGAG GAAGTT CAT 2 0 90 

I M II I II I I II I M I I I I M I I M I I II II I II I I II II II I I 
2304 GAAT T GC C C CAT GAC CTTTCTTT GAAGAAC AT ACAACC CAAAGT T GAAGAGAAAAT CAGT 2363 

2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II M I II II II M M M I I I M I I II I I I M I I II M I I 

2364 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2423 

2151 GAT GTTTCTGCTTTG GAT GC T CAAGC AGAGAT AGGCAG C AT AGAAAAAC C CAAAGT T C T T 2210 

II II I II I II I I I I II II I II II I I M I M I I II I I II II I 

2424 GAT GTTTCTGCTTTGGC C ACT CAAGC AGAGAT AGAGAGC AT AGT TAAAC C CAAAGT T CT T 24 83 

2211 GT GAAAGAAGCCGAGAGAAAACTT C CTT CT GAT ACAGAAAAAGAGCGAAGAT CT C CAT CT 2270 

I I I II II II II MM I II II II II I II II M I I II II M II I Mill MUM 



Db 24 84 GT GAAAGAAG C T GAGAAAAAACT T C C T T C C GAT ACAGAAAAAGAG GAC AGAT C AC CAT CT 2543 

Qy 2271 GCT AT AT T T T C AG C AGAGCT GAGT AAAAC T T C AGT T GTT GAC CT C CT C T AC T GGAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2544 G CT AT AT T T T C AG C AGAG CT GAGT AAAAC T T C AGT T GTT GAC CT C C T GT AC T GGAGAGAC 2 603 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I II I I I I I I I I 
Db 2604 AT T AAGAAGACT G GAGT G GT GT TT G GT GC CAG C CTAT TCCTGCTGCTTT CAT T GACAGT A 2663 

Qy 23 91 TTCAGCATTGT GAGT GTAACGGCCTACATTGCCTTGGCCCT GCT CTCTGTGACTATCAGC 2450 

I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I 

Db 2664 T T CAGCAT T GT GAGC GT AACAG CC T AC AT TGCCTTGGCCCTGCTCTCTGT GAC CAT C AGC 2723 

Qy 24 51 T T TAG GAT AT ATAAGGGT GT GAT C CAG GC TAT C C AGAAAT CT GAT GAAGGC C AC C CAT T C 2510 

II I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I II I I I I I I I II I I I I II I I I I 

Db 2724 T T T AGGAT AT ACAAG G GT GT GAT C CAAGCT AT CC AGAAAT C AGAT GAAGGC C AC C CAT T C 27 83 

Qy 2511 AG GG CAT AT T T G GAAT C T GAAGTT G C TAT AT CT GAGGAGT T G GT T CAGAAGT ACAG C AAT 2 570 

I I II I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I Ml 
Db 27 84 AG G G CAT AT CT G GAAT CT GAAGTT G C TAT AT C T GAGGAGT T GGT T CAGAAGT ACAGT AAT 2 843 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 2844 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2903 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 690 

I I I I I I I I I I I I I I I I I I I I I II I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2904 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 963 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I 
Db 2964 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3023 

Qy 2751 AT T TAT GAAC GG CAT C AGG C G CAAAT AGAT C ATT AT CT GGGACT T GCAAAT AAGAAT GT T 2810 

I I I I I I I I I I I ! I I I I I I ! I M I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I 
Db 3024 ATTT AT GAACGGCAT C AGG CAC AGAT AGAT CATT AT CTAGGACT T GCAAAT AAGAAT GTT 3083 

Qy 2811 AAAGAT G CT AT GG CT AAAAT C CAAGC AAAAAT C C C T GG AT T GAAGC GTAAAGCT GAAT GA 2 87 0 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 3084 AAAGAT G CT AT GG CT AAAAT C CAAGCAAAAAT C C C T GGAT T GAAGC GC AAAG CT GAAT GA 3143 

Qy 2871 GAAAG C CT GAAAGAGT T AACAAT AGAGGAGT T TAT CT TT AAAGG GGAT AT T CAT T T GAT T 2930 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 3144 AAAC GC C C AAAAT AAT T A GT AGGAGT T CAT CT TTAAAGG GGAT AT T CAT T T GAT T 3198 

Qy 2931 CCAT T G G G GAGG GT C AGGGAAGAAC AAAG C C T T GACATT G CAGT GC AGT T T CAC 2984 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3199 ATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3257 

Qy 2985 AGAT C T T TAT T T T T AGCAAC G CAGT G - T C T GAGGAAAAAT GAC C T GT CT T GACT G 3038 

I I I I I I I I I I I I I I I I I I Ml II I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 3258 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 3317 

Qy 3039 C C CT GT GT T CAT CAT C T TAAGT AT T GTAAG CT GCT AT GT AT G GAT T T AAAT C GT AAT CAT 3098 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3318 C CAT GT GT T CAT CAT C T TAAGT AT T GTAAGCT G CTAT GT AT GGAT T T AAAC C GT AAT CAT 3377 



Qy 




a t T T r T T T T T P P T f^T A T G A G PI C AC T GGT GAAT AAACAAAGAT CT GAGAAAG C T GT AT AT T 

| | | | | | 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 

A— TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATT 


3158 


Db 


OO/O 


3424 


Qy 




ararTTTPTPPPAPnTAnTPTTGPTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 

| | | || | | II 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 M 1 1 

T T AC T TT GT T GC AGAT AGT CT T G C C G CAT C T T G G C AAGT T GCAGAGAT GGT GGAG C T AGA 


3214 


Db 




3484 


Qy 


3215 


rar ar aaataapppttttPAPAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATG 

I | | | | | t 1 II 1 1 II 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATG 


3271 


Db 




3544 


Qy 


3272 


r*A^7\rprpmm r .mr , a7vATr n a a T CTTT A c, AC G AG AT CAT GCCAC CAAGGC AGGAGT GAA 

Ln.on.X J. X X L X LnnnX onnn. 1 ox X X n.On\-v <J<r\\J.r\ x w v^^-n.^-' ^ i.^ \_i * 
| | | || 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M M 1 1 1 II 1 III 

CAGAT T T T CT GAAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C GGT AAAGC AGGAAT GAC 


3327 


Db 


3545 


3604 


Qy 


o o o o 


aaarrTTrrPTTTPPTPPTATGTTPTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 

I M 1 1 1 1 1 MM 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 

AAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTG 


3387 


Db 


3605 


3663 


Qy 


3388 


nr> 7\ 7V rp t\ m 7v t\ r^T* a tv tv m a t 7\ r aTT AT AT AT AT PT AT AT ATAGTGTTTCACGAAGCTTAGCCC 
CCAATA1 nAb iAAAlAlAbnl X >\X /\X -M.X n.X L-Jlnlftininoi vjx x x ^nv^unnu j. v ^ w ^ w 

M 1 II II M II M 1 M 1 1 M 1 II 1 1 1 1 1 Ml 1 M M 

CCAATAT AAGT AAAT AT AGAT TATA TAT GT AT AGT GT T T C ACAAAGCT T AGAC C 


3447 


Db 


3664 


3717 


Qy 


34 48 


r r r r m7\ nr .mr rrrrZ \rPTPPPPPAPAGTGPTTGATACT TCTGTCATGGGTTTTATGT 

M 1 1 1 1 M II II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 

T T T AC CT T - C C AGC C AC C C C AC AGT G CT T GAT ATT T C AGAGT C AGT CAT T G GT TAT AC AT 


3501 


Db 


3718 


3776 


Qy 


3502 


r-m rm * n m n n n A A t\ r n a p a t a a p pt A P p P, A P A A A PGT APTT CT AGGC GCACT AC CATCTGT 

| M II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M N M 1 1 1 1 M M 1 1 

GTGTAGTTCCAAAGCACATAAGCTAGAAGAAGAAATATTTCTAGGAGCACTACCATCTGT 


3561 


Db 


37 77 


3836 


Qy 


3562 


ttt r a a r a r r a a p p p, a P PtP r AT GP AAACAGAACT CC- T CAACATAAACTT CACT GCACAG 

| | | | M II 1 M 1 M 1 1 1 II 1 M II II II 1 II 1 1 1 1 1 1 M 

TTT CAAC AT GAA AT GC C ACAC ACAT AGAACT C CAACAAC AT CAAT T T CAT T GCACAG 


3620 


Db 


3837 


3893 


Qy 


0/T01 

3o21 


arTTarTTTAPTTAATTTTATPAP — AAA CTCTGGACT GAAT CTAATGCTTCCAAAAA — 

/\L X X n.L 1 bi r\\J X X nn. X X X X n X v^n^ nnn^ x ^ x w\jn»j j. \* i. j.^_-xj ^ v * 

IN || || 1 II M 1 1 M 1 1 MM II II MUM M M II M 1 1 

ACT GACT GT AGT T AAT T T T GT C AC AGAAT CT AT GGACT GAAT CT AAT GCT T C CAAAAAT G 


3676 


Db 


o o n /i 

38 94 


3953 


Qy 


jD / / 


TGTTT GCAAAT AT C AAACAT T GT T AT GT AAGAAAAT AT AAAT GAC GA 

I 1 II 1 1 1 1 1 1 II 1 II II 1 1 1 II 1 1 II 1 II 1 1 1 1 III 1 1 1 1 M M 

T T GT T T GT T T GCAAAT AT CAAACAT T GT T AT GCAAGAAAT TAT T AAT T ACAAAAT GAAGA 


3723 


UD 


-J J *i 


4013 


Qy 


3724 


TTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAAlI Gl 

I Mill II 1 II 1 1 1 II II 1 II 1 M M 1 1 II 

TTTATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGAACTGT 


ft ^ 


Db 


4014 


4073 


Qy 


3784 


AAAAG CAAAGT AT CAAT AAAGCT TAT AGACT TAAAA 3819 

II II M 1 1 1 II II 1 II II 1 II II 1 1 1 1 M 1 M 1 1 1 1 

AAAAG CAAAGT AT CAAT AAAGCT TAT AGACT TAAAA 4109 




Db 


4074 
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AY123247 
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4123 bp mRNA linear PRI 23-SEP-2003 
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Homo sapiens RTN4 isoform D (RTN4) mRNA, complete cds ; 

alternatively spliced. 

AY123247 

AY123247.1 GI:26800580 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 4123) 

Oertle,T., Huber,C, van der Putten,H. and Schwab, M. E . 

Genomic structure and functional characterisation of the promoters 

of human and mouse nogo/rtn4 

J. Mol. Biol. 325 (2), 299-323 (2003) 

22376540 

12488097 

2 (bases 1 to 4123) 
Van der Putten,H. 
Direct Submission 

Submitted ( 07-MAY-2002 ) Nervous System Research, Novartis Pharma 
Inc., Basel, Switzerland 

3 (bases 1 to 4123) 
Oertle,T. and Schwab, M.E. 
Direct Submission 

Submitted ( 17-MAY-2002 ) Brain Research Institute, University of 
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Location/Qualifiers 
1. .4123 

/ organ ism=" Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/ chromosome="2 " 
/map="2pl6" 
/ t is sue_type=" testis" 
1. .4123 
/gene="RTN4 H 
/note=" synonym: NOGO" 
1. .196 
/gene="RTN4" 
197. .3157 
/gene="RTN4" 

/note="RTN4-D; alternatively spliced" 
/codon_start=l 
/product="RTN4 isoform D" 
/protein_id="AAM64251 . 1" 
/db_xref="GI : 26800581" 

/ translation- "MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHE 
YLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSP 
KAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAK 
DSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKC 
FADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPL 
LGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLV7\AQDSETDYVTTDNLTKVTE 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 
SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENP 
PPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPD 
FSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLT 



ETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPL 
QMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE 
YTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATS 
KVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTS 
WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 
AIQKSDEGHP FRAYLESEVAI S EELVQKYSNS ALGHVNCT I KELRRLFLVDDLVDS LK 
FAVLMWFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMA 

KIQAKIPGLKRKAE" 
3'UTR 3158. .4123 



/gene="RTN4" 



ORIGIN 



Query Match 70.9%; Score 2716; DB 9; Length 4123; 

Best Local Similarity 86.0%; Pred. No. 0; 

Matches 3332; Conservative 0; Mismatches 445; Indels 99; Gaps 25; 

CTATCTCCTCTCTCAGCCGCTGCTTTTAAAG7UVCGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I 1 II M 
CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 34 9 

GT AC T G C C CACT GAAGGAAC ACT T C C AGCAACT T CAAAT GAAG CT T CT AAAG C ATT CT C A 12 0 
Ml I I I I I I I I I I I I I I I I 1 I I I II II I I I I I I I I I I I I I I I I I I I II 
GT AT T AC C CACT GAAG GAAC ACTT CAAGAAAAT GT CAGT GAAGCT T CT AAAGAGGT CT C A 409 

G AGAAGG CAAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T CAGAAT T G GAAT AT 180 
I I I I I I I I I I I I I II I I I I I I III I I I I I I II I I I I I I I I I I I I I I I I I I 
GAGAAGGCAAAAACT CT ACT CAT AG AT AGAGAT T T AAC AGAGTT T T CAGAAT T AGAAT AC 469 

TCAGAAATGGAAT CAT CAT T CAGT GGCTCT CAAAAG GC AGAAC CT GCC GT AAC AGTAGC G 24 0 

I | I I I I I I II I I I I I I I I I I I I I Mill III MINI I I I I I I I II I I M I I 

T CAGAAAT G GGAT CAT C GT T CAGT GT C T CT C CAAAAGC AGAAT CT GC C GT AAT AGT AG C A 52 9 

AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 30 0 

I I II I I I I II I I II I I I M II 

AAT C CT AG G GAAGAAAT AAT C GT GAAAAAT AAAGAT G AAGAAG AGAAGT T AGT T AGT AAT 589 

AAC AT C CT T CAT AC T C AG C AGGAGT TAT C T AC AGT C CT T AC GAAAT C AGTT GAAGAAGAA 360 

I I I I I I I I I I I II III II II I I I I I I I I I I I I I I I I I I I I I I I I I I II 
AACAT C CT T CAT AAT CAAC AAGAGT T ACC T AC AG CT C T T ACTAAAT T G GT T AAAG AGGAT 64 9 

GAT AGAGT T CT GT C T C C AGAAAAAACAAAGGACAGT T T T AAG GAAAAGG GAGT T GCAG C A 420 

II I I II I I I I I I M I I I I I I'll I I I I I I I I I I I II I I I I I I I II I I I I 

GA AGT T GT GT CT T C AGAAAAAGCAAAAGAC AGT T T T AAT GAAAAGAG AGT T G CAGT G 706 

GAAGCT T CT AT G GG G GAG GAAT AT G C AGACT T CAAAC CAT T T GAG C GAGT AT GG GAAGT G 4 8 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I 
GAAG C T C C TAT GAGGG AGGAAT AT GC AGAC T T CAAAC CAT T T GAGC GAGT AT GGGAAGT G 766 

AAAGAT ACTTACAAGCAAGAT AGT GAT GTTTT GATT GCT GGAGGTAAT AT AGAGAGCAAA 54 0 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I M I I I I I II I I I I 

AAAGATAGT AAG GAAGAT AGT GAT AT GT T G GCT GC T GGAG GT AAAAT C GAGAGC AAC 823 

T T G GAAG GT AAAGT GGAT AAGAAAC AC T T T T C AGAT AGC CT T GAACAAACAAAT C GT GAA 600 

|| | | II I II I I M II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I Ml 

T T GGAAAGT AAAGT G GAT AAAAAAT GT T T T GC AGAT AG C CT T GAGCAAACT AAT C AC GAA 8 8 3 



Qy 


1 


Db 


290 


Qy 


61 


Db 


350 


Qy 


121 


Db 


410 


Qy 


181 


Db 


470 




9 41 

1 


Db 


530 


Qy 


301 


Db 


590 


Qy 


361 


Db 


650 


Qy 


421 


Db 


707 


Qy 


481 


Db 


767 


Qy 


541 


Db 


824 



Qy 



601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 



I I I I I I I II I I II I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I Ml 

Db 884 AAAGAT AGT GAGAGT AGT AAT GAT GAT ACT T C T T T C C C CAGT AC G C C AGAAG GT AT AAAG 943 

Qy 661 GGT GGT T C C GGAGC GT ACAT CAC GT GT GCT C C CT TT AACC CAAC AACT GAGAAT GT T 717 

I I I I I I I I I I I II Mill I I I I I I I M II I I II I II I II I I I II I II 

Db 944 GAT C GTT CAGGAGCAT ATAT CACAT GT GCT C C CT TTAACCC AGCAGCAACT GAGAGCAT T 1003 

Qy 718 T CAACAAAC AT TTTTCCCTTGT T GGAAGAT CAT ACT T C GGAAAAT AAGAC AGAT GAAAAA 7 77 

II I I I I I I I I II I I II I I I I I I I I I I I I I II II I II I I II I I I I I I I I II I I I 
Db 1004 G CAACAAAC AT TTTTCCTTT GT T AG GAGAT C C T AC T T CAGAAAAT AAGAC C GAT GAAAAA 1063 

Qy 77 8 AAGAT AG AA- AAAAAAAG G C AC AAAT T GT AAC AG AGAAGAAT G C AAGT GT C AAGAC AT C A 836 

II MIMI MM I I M I I I I I I I I I I II M I I M I I I I M II I MUM 

Db 1064 AAAAT AGAAGAAAAGAAGG C C CAAAT AGT AAC AG AGAAGAAT AC T AGC AC CAAAAC AT C A 1123 

Qy 837 AAC C CT T T C CT T AT GGCAGCAC AG GAGT CT AAGACAGAT T AC GT T ACAAC AGAT CAT GT G 896 

I I II I I II III I I II M I I II I I III I II M I II I I II II I II I II I II I 

Db 112 4 AACC CTT TT CTT GTAGCAGCACAGGATT CT GAGACAGATT AT GT CACAACAGATAATTT A 1183 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

II II II II M M I M I I I I I I I I I I I I II II I I I I II I II II I II I II I I I I 
Db 1184 AC AAAG GT GACT GAG GAAGT C GT GG CAAAC AT GC CT GAAGG C CT GACT C C AGAT T T AGT A 1243 

Qy 957 C AG GAAGCAT GT GAAAGT GAAT T GAAT GAAGC TACT GGT AC AAAAAT T G C CT T T GAAAC A 1016 

II II M I II I I II I I I I I II I II I I II I II I I I I I II II II I I Mill I M I I I I I 

Db 1244 CAGGAAGCAT GT GAAAGT GAATTGAAT GAAGTTACT GGTACAAAGATT GCTT ATGAAACA 1303 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

I I I II I II I II II M I II I II I II I I I II I I I M II I I I II MM I I I I I I 
Db 1304 AAAAT G G ACT T G GT T CAAACAT CAGAAGT T AT GCAAGAGT C ACT C TAT C CT GC AGCAC AG 1363 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I M II I I II I I I I I II II II I II II I II I I I Mill I I II M M II I II I I M 
Db 1364 CT T T G C C CAT C ATT T GAAGAGT C AGAAGC T ACT C CT T CAC CAGT T T T GC CT GAC AT T GT T 1423 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I II I I M II II I I I I M I I I M I I II I I I II I II I I I MM III 
Db 1424 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1483 

Qy 1197 T CAT CAC CAT T AGAAACT CT T C CT T CAGT T AAT TAT GAAAGC ATAAAGT T T GAG C C T GAA 1256 

I || II I II I M II I I I I M M I II II I I I I I I I II I I II II I I I I I I II I I I 

Db 1484 T CAT CAC CAT T AG AAG C TT CTT CAGTT AAT TAT GAAAGC ATAAAACATGAGCCTGAA 1540 

Qy 1257 AAT C C C C CAC CAT AT GAG GAG GC CAT GAAT GT AT CACT AAAAAAAGAAT C AGGAAT GAAT 1316 

II I I I II I I I II I II I I II II M I M M I I I I I I II I I II II I M I I II II I M 

Db 1541 AACC C CC CAC CAT AT GAAGAGGCCAT GAGT GTAT CACTAAAAAAAGT AT C AGGAAT AAAG 1600 

Qy 1317 GAAGAAAT C AC AGAGC CT GAAG GT AT T AGT GT AGCT GTT C AG GAAAC AGAAG CT C C T TAT 1376 

I I M I I II I I M M II II I I I I I I M I I I I I II I II I II I II M I M II I I I 

Db 1601 GAAGAAAT TAAAGAG CC T GAAAAT ATT AAT GCAGCT CTT CAAGAAAC AGAAGCT CCTTAT 1660 

Qy 1377 ATAT C TAT T GCAT GT GAT T T AAT T AAAGAAACAAAGAT CT CT ACT GAAC C GACT C CAGAT 1436 

II I I I I II I M I M II II I II I II I II II I I I I II I I Ml I I II I I I M M III 

Db 1661 ATAT CT AT T GCAT GT GAT T T AAT T AAAGAAACAAAG CTTTCTGCT GAAC C AG CT C C GGAT 1720 



Qy 1437 T T C T CT AGT TAT T C AGAAAT AG C AGAAGT T GCAC AGC CAGT GC C C GAGCAT T CT G AGC T A 1496 

II || I I I I I I II II M I I Ml M II I I I I I I II I I I I I I II I II I II M II I I 



Db 



1721 T T CT C T GAT TAT T CAGAAAT G G CAAAAGT T GAACAGCC AGT G C C T GAT CAT T CT GAGC T A 178 0 



Qy 1497 GT T GAAGAT TCCTCCCCC GAT T CT GAAC C AGT T GACTT AT T T AGT GAT GAT T CAATAC C C 1556 

II I I I I I I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I 
Db 17 81 GT T GAAGAT T C CT C AC CT GAT T C T GAACCAGT T GAC T TAT T T AGT GAT GAT T CAATAC C T 1840 

Qy 1557 GAAGT T C C ACAAAAAC AAGAT GAAGCT GT AAT AC T T GT GAAAGAAAAC C T C AC T GAAAT T 1616 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 1841 GACGT T CC ACAAAAAC AAGAT GAAACT GT GAT GCTT GT GAAAGAAAGT CT CACT GAGACT 1900 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCTT CAC CAT CACCT 1676 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I 
Db 1901 T CATTT GAGTCAAT GATAGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 1957 

Qy 1677 GAGG GAGGAAAAC C GT AT T T G GAGT CT T T T C AG C C CAGT T T AG GC AT C ACAAAAGAT AC C 1736 

I I I I I I I I I II II MINN I I I I I I III I I I I I I I I I I I II I I I I I I I II 
Db 1958 GAGG GAGGAAAGC C AT AT T T G GAAT CT T T T AAGCT CAGT T T AGAT AAC AC AAAAGAT AC C 2 017 

Qy 1737 T TAG C AC CT GAT GAAGT T T C AGC AT T GAC C CAAAAG GAGAAAAT C CCT T T GC AGAT G GAG 1796 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I II I I I I I I I I I I I I 

Db 2018 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 2077 

Qy 1797 GAGCT CAATAC T GCAGT T TAT T CAAGT GAT G GCT T AT T CAT T G CT CAGGAAG CAAAC CT A 1856 

I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I III II II I I I I I I I II 
Db 2078 GAGCT CAGT AC T GCAGT T TAT T C AAAT GAT GAC T TAT T TAT TT CT AAGGAAG CAC AGAT A 2137 

Qy 18 57 AGAGAAAGT GAAACAT T T T C AGAT T CAT C T C C GAT T GAGATT AT AGAT GAGT T C C C GAC C 1916 

I I I I I II I I I II I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 

Db 2138 AGAGAAACT GAAAC GT T T T C AGAT T CAT C TC CAAT T GAAAT TAT AGAT GAGT T C C C T AC A 2197 

Qy 1917 T T T GT CAGT T CTAAAG C AGATT CT T C T C CT ACAT TAGC CAGGGAAT ACACT GAC CTAGAA 197 6 

II I I I I I I I I I I I I Mill I I III I I I I I I I I I I I I I M I II M I I I I I I I 

Db 2198 TT GAT CAGTT CTAAAACT GATT CATTTT CTAAAT TAGC CAGGGAAT AT ACT GACCTAGAA 2257 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I II I I I I I I I I I I I I II 

Db 2258 GT AT C C CACAAAAGT GAAAT T G CT AAT G C CC C G GAT G GAG CT GGGT CAT T G C CTT GCAC A 2317 

Qy 2037 GGAT T GC C C CAT GAC CTTTCTTT CAAGAGT AT ACAAC C TAAAGAG GAAGT T CAT 2090 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I 

Db 2318 GAAT T GC C C CAT GAC CTTTCTTT GAAGAACAT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 2377 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I I I I I I I II II III II II II I II I I I I I I II I I I I I I I I I I I 

Db 2378 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2437 

Qy 2151 GAT GT T TCTGCTTT GGAT GCT CAAGCAGAGAT AG G CAG C AT AGAAAAAC C C AAAGT T CTT 2210 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I M I I 
Db 2438 GAT GT T TCTGCTTTGGC CACT CAAGCAGAGAT AGAGAG CAT AGT TAAAC C C AAAGT T CTT 2497 

Qy 2211 GT GAAAGAAGC CGAGAGAAAACT T CCTT CTGATACAGAAAAAGAGC GAAGAT CT CCAT CT 227 0 

I I I I I I I I II I I I II I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 24 98 GT GAAAGAAGC T G AGAAAAAACT T C CT T C CG AT AC AGAAAAAGAG GAC AGAT CAC CAT CT 2557 

Qy 2271 G CT AT ATT T T CAG CAGAG CT GAGTAAAAC TT CAGT T GT T G AC CT CCT CT AC T G GAGAGAC 2330 

I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I M 
Db 2558 GCT AT AT T T T CAG C AGAG CT GAGTAAAAC TT CAGT T GT T G AC CT CCT GT AC T G GAGAGAC 2617 



Qy 


2331 


Db 


2618 


Qy 


2391 


Db 


2678 


Qy 


2451 


Db 


2738 


Qy 


2511 


Db 


2798 


Qy 


2571 


Db 


2858 


Qy 


2631 


Db 


2918 


Qy 


2691 


Db 


2978 


Qy 


2751 


Db 


3038 


Qy 


2811 


Db 


3098 


Qy 


2871 


Db 


3158 


Qy 


2931 


Db 


3213 


Qy 


2985 


Db 


3272 


Qy 


3039 


Db 


3332 


Qy 


3099 


Db 


3392 



I I I I I I I I I I M I II I I I I I I II II I I I II I I I I I I I II I I I I I I II I I I I I I I I 



TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I M I I I I I I 
TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 27 37 

T T T AGGAT AT AT AAGG GT GT GAT C CAGG CT AT C C AGAAAT CT GAT GAAG GC CAC C CAT T C 2510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I 
TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGj^AATCAGATGAAGGCCACCCATTC 27 97 

AG GGC AT AT TT G GAAT C T GAAGT T G CT AT AT CT GAGGAGT T GGT T C AGAAGT ACAGCAAT 257 0 
I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AG GG CAT AT CT GGAAT CTGAAGTT GCTATATCT GAGGAGT T GGTT C AGAAGT ACAGTAAT 2 8 57 

TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2 917 

GAT T T AGT T GAT T CT CT GAAGT TT G C AGT GT T GAT GT GGGT AT TT AC CT AT GT T GGTGC C 2 690 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I i I I I I I I I I I I I I I I I I I I I M I I I I 

GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2977 

TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I II I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3037 

ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCT GGGACTT GCAAATAAGAATGTT 2 810 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
ATTT AT GAACGGCAT CAGGCACAGAT AGAT CATTAT CTAGGACTT GCAAATAAGAATGT T 3097 



I I I I I I I I I I I I I II I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I MINIMUM 
AAAGAT GCT AT GGCTAAAAT C CAAG C AAAAAT C C C T GGAT T GAAGC GCAAAGCT GAAT GA 3157 

GAAAG C CT GAAAGAGT T AACAAT AGAG GAGTT T AT CT T T AAAGGG GAT AT T CAT T T GAT T 2 930 

I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I II I I I I I I M I I I I 

AAArfimnAAAATAATTA GT AGGAGTT CAT CT T T AAAG G GGAT AT T CAT T T GAT T 3212 



C CAT T GGGGAGGGT CAGG GAAGAACAAAGC C T T GACAT T G C AGT GC AGT T T CAC 2984 

I I I I I I I I II I I I I I II I I II II I I I I II I I I I I I I I I I I I I I I I I I 



— AGAT C T T TAT T T T T AGCAAC GC AGT G - T C T GAGGAAAAAT GAC CT GT CT T GACT G 303 8 

I I I I I I I I I I I I I I I I I I III II I I I II M I I I I I I I I I I I I I I I I I I I 
T T GT T AGAT CT T TAT T T T T AGC C AT GC ACT GT T GT GAG GAAAAAT T AC C T GT C T T GACT G 3331 

CCCT GT GTT CAT CAT CT TAAGTATT GTAAGCT GCT AT GTAT GGAT TTAAAT CGTAAT CAT 3 098 

II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

C CAT GT GT T CAT CAT CT TAAGTATT GT AAG CT G CT AT GTAT GGAT T TAAAC C GT AAT CAT 3391 

AT TTGTTTTTCCT GTAT GAG GC ACT GGT GAAT AAACAAAGAT C T GAGAAAGCT GTAT AT T 3158 

! I I I I I I I II I I I I I I I I I I I I I I I | | | | I I I I I II I I I 

A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC CT GTAT AT T 3438 



Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I I I I I t I I I I I I I I I I I I I I II I I II I I I I I I I II I I I I M I I 
Db 3439 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGA 3498 

Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATG 3271 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 3499 AAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATG 3558 

Qy 3272 C AG AT T T T C T G AAAT G AAA T GT T T AGAC GAGAT CAT GC C AC CAAG G C AGG AGT GAA 3327 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I III 

Db 3559 C AGAT T T T CT GAAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C G GT AAAG C AGGAAT GAC 3618 

Qy 3328 AAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 3387 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3619 AAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTG 3677 

Qy 3388 CCAATATAAGTAAATAT AGATTAT AT AT AT CT AT AT AT AGT GTTT CACGAAGCTTAGCC C 3447 

I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II 

Db 3678 CCAATATAAGTAAATAT AGAT TATA TAT GTAT AGT GTTT CACAAAGCTT AGAC C 3731 

Qy 3448 TTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGT 3501 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I 

Db 3732 T T T AC C T T - C C AGC C AC C C C AC AGT G C T T GAT AT T T C AGAGT CAGT CAT T GGT TAT AC AT 3790 

Qy 3502 GTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGT 3561 

I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I M 

Db 3791 GT GT AGT T C CAAAGCACATAAGCT AGAAGAAGAAAT AT T T CT AG GAGCAC T AC CAT CT GT 3850 

Qy 3562 TTT CAACAC GAACCGAC GCCAT GCAAAC AGAACT CC- T CAACATAAACTT CACT GCACAG 3620 

I I II I I I I I I I I I I II I I I II I I M I I I I I I I I I I I I I I I M I I I I 
Db 38 51 TTT C AAC AT GAA AT GCCACACACAT AGAACT CCAACAACAT CAATTT CAT T GCACAG 3907 

Qy 3621 ACT T AC T GT AGT T AAT T T TAT CAC — AAAC T CT G GAC T GAAT CT AAT GCT T C CAAAAA — 3676 

III I I I I I I II I I I I I I I I I I I II II I I I I I I M I I I M I I I I I I I II I I I I 
Db 3908 ACT GACT GT AGT T AAT T T T GT CAC AGAAT C TAT G GAC T GAAT C T AAT GCT T C CAAAAAT G 3 967 

Qy 3677 T GT T T GCAAAT AT CAAAC AT T GT T AT GT AAGAAAAT AT AAAT GAC G A 3723 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III I I I I M M 

Db 3968 TTGTTTGTTTG C AAAT AT CAAAC AT T GT T AT GC AAGAAAT TAT T AAT T AC AAAAT GAAGA 4027 

Qy 3724 T T T AT ACAAT T GT G GTT T AAGC T GTAT T GAACT AAAT CT GT G GAAT GC AT T GT GAACT GT 3783 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 4 028 TTTATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGT GAACT GT 4 087 

Qy 3784 AAAAGCAAAGTAT CAATAAAGCTT ATAGACTTAAAA 3819 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 4 08 8 AAAAGCAAAGTAT CAATAAAGCTT ATAGACTTAAAA 4123 
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source 1. .4160 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon:9606" 
/ chromosome="2 " 
/map="2pl6" 
/ tissue_type="testis " 
gene 1 . . 4160 

/gene="RTN4" 
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/gene="RTN4" 
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/product="RTN4 isoform Ab" 
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/db_xref="GI: 26800578" 

/trans la tion= M MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHE 
YLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSP 
KAESAVI VANPREEI IVKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEVVS S EKAK 
DSFNEKRVAVEAPMREEY7VDFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKC 
FADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPL 
LGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 
SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENP 
PPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPD 
FSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLT 
ETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPL 
QMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE 
YTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATS 
KVLLLPPDVSTUjATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAlFSAELSKTS 
WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 



AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLK 
FAVLMWFTWGALFNGLTLLII^ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMA 
KIQAKIPGLKRKAE" 
3'UTR 3195. .4160 

/gene="RTN4" 

ORIGIN 

Query Match 70.9%; Score 2716; DB 9; Length 4160; 

Best Local Similarity 86.0%; Pred. No. 0; 

Matches 3332; Conservative 0; Mismatches 445; Indels 99; Gaps 25; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I II II 
Db 327 CTGTCTCCTCTCTCAGCCGCTTCTTTC7VAAGAACATGAATACCTTGGTAATTTGTCAACA 38 6 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGTVAGCTTCTAAAGCATTCTCA 12 0 

I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 
Db 387 GT AT T AC C C AC T GAAGGAAC ACT T CAAGAAAAT GT CAGT GAAGCT T CT AAAGAGGT C T C A 44 6 

Qy 121 GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T C AGAAT T G GAAT AT 18 0 

I I I I I I I I I I I I I II I I I I I I III I I I I I I I I II II I I I I I I I II I II I I 
Db 447 GAGAAGGCAAAAACT C TACT CAT AGAT AGAGAT T T AAC AGAGT T T T C AGAAT T AGAAT AC 50 6 

Qy 181 T C AGAAAT G GAAT CAT CAT T CAGT G G C T C T CAAAAGG C AGAAC CT GC C GTAACAGT AGC G 24 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I II I 
Db 507 T C AGAAAT G GGAT CAT C GT T CAGT GT CT CT C CAAAAGC AGAAT C T GC C GT AAT AGT AG C A 566 

Qy 241 AAT C CT AG GGACGAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAGGAC T T AGT T AGT CT T 300 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I 
Db 567 AAT CCT AGGGAAGAAATAAT CGT GAAAAATAAAGAT GAAGAAGAGAAGTTAGTTAGT AAT 626 

Qy 301 AAC AT C C T T CAT ACT C AG C AGGAGT T AT C T AC AGT CCT T AC GAAAT CAGT T GAAGAAGAA 360 

I I I I I I I I I II I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 627 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 686 

Qy 361 GAT AGAGT T CT GT CT C CAGAAAAAACAAAGGAC AGT T T TAAG GAAAAGG GAGT T GCAGC A 420 

II I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II 

Db 687 GA AGT T GT GT CT T CAGAAAAAGCAAAAGAC AGT T T TAAT GAAAAGAGAGT T GCAGT G 743 

Qy 421 GAAGCT T CT AT GGGGGAGGAAT AT GCAGACT T C AAAC CAT T T GAG C GAGT AT G G GAAGT G 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 744 GAAG CT C CT AT GAGGGAGGAAT AT G C AGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 803 

Qy 481 AAAGAT AC T T ACAAG CAAGAT AGT GAT GT TT T GAT T GCT GGAG GT AAT AT AGAGAGC AAA 540 

I I I I I I I I III I I I I II I II I I I III I II I I I I I I I II II I I I I I I I I 
Db 804 AAAGATAGT AAG GAAGAT AGT GAT AT GT T GGC T GC T GGAGGTAAAAT C GAGAGC AAC 860 

Qy 541 TT GGAAGGTAAAGT GGATAAGAAACACT TTT CAGATAGCCTT GAACAAACAAAT C GTGAA 600 

I I I I I I I I I I I I II I I I I I III III I I I I I I I I I I I I I I I I I I I I I I III 
Db 861 T T GGAAAGTAAAGT G GAT AAAAAAT GT T T T G C AGAT AG CCT T GAG CAAACT AAT C ACGAA 920 

Qy 601 AAAGATAGT GAAAG CAGT AAT GAT GACACT T CAT T T C C C AGT ACAC C AGAAGCT GT AAGA 660 

I I I I I I I I I I I. II I II I I I I I I I I I I I I I II I I I I I I I I II I I I I I I Ml 
Db 921 AAAGATAGT G AGAGTAGT AAT GAT GAT ACT T CT T T C C C CAGT AC G C CAGAAG GT AT AAAG 9 80 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AAC T GAG AAT GT T 717 

I I I I I I I II I I II I I I I I I I II I I II I I II I I II II I II I II II I II 



Db 981 GAT C GT T C AG GAG CAT AT AT C ACAT GTGCTCCCTT T AAC C C AGC AG C AACT GAGAGC AT T 104 0 

Qy 718 T CAACAAACATT T T T CCCTT GT T GGAAGAT CAT ACTT CGGAAAATAAGACAGAT GAAAAA 777 

I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1041 GCAACAAACAT TTTTCCTTT GT TAG GAGAT C CT AC T T C AGAAAAT AAG AC C GAT GAAAAA 1100 

Qy 778 AAGATAGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGACAT CA 836 

II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 
Db 1101 AAAAT AGAAGAAAAGAAGG C C CAAAT AGT AACAGAGAAGAAT AC T AGC AC CAAAAC AT CA 1160 

Qy 837 AAC C CT T T C CT T AT GG C AG C AC AGGAGT CT AAGACAGAT T AC GT T AC AACAGAT CAT GT G 896 

I I I I I I II III I I II I I I I I I I I III I I I I I I I I I I II I I I I I I I I I II I 
Db 1161 AAC C CT T T T C TT GT AGC AGC AC AG GAT T C T G AGAC AG AT TAT GT C ACAACAGAT AAT T T A 1220 

Qy 897 T CAAAG GT GAC C GAG GAAGT AGT GG CAAAC AT GC C T GAAGGT C TAAC C C CAGAT T T G GT T 956 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 1221 ACAAAGGT GACT GAGGAAGT CGT GGCAAACAT GC CTGAAGGC CT GACT C CAGATTTAGT A 12 80 

Qy 957 C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGCT AC T G GT ACAAAAAT T GC CT T T GAAACA 1016 

I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 1281 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT T ACT G GT ACAAAGAT T GCT T AT GAAACA 1340 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I II 
Db 1341 AAAAT GGACT T GGT T CAAAC AT CAGAAGT TAT G CAAGAGT C AC T C TAT C C T GC AGC AC AG 1400 

Qy 1077 CTTTGCC CAT CTTTTGAAGAATCT GAAGCT ACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I i I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 01 CTTTGCC CAT CAT TT GAAGAGT CAGAAG CT ACT C CT T C AC C AGT T T T G C C T GAC AT T GT T 1460 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I III 
Db 1461 AT GGAAGCAC CAT T GAAT T C T G CAGT T C CT AGT G CT G GT GCT T C C GT GAT AC AGC C C AG C 1520 

Qy 1197 T CAT CAC CAT T AGAAACT CT T C C T T CAGT T AATT AT GAAAGC AT AAAGT T T GAGC CT GAA 1256 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 1521 T CAT CAC CAT T AGAAGC T T C T T CAGT T AAT TAT GAAAG C AT AAAAC AT GAG C C T GAA 1577 

Qy 12 57 AAT CCCC CAC CAT AT GAGGAGGCCATGAAT GTAT CACTAAAAAAAGAAT CAGGAAT GAAT 1316 

II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II II 

Db 1578 AAC CCCC CAC CAT AT GAAGAGG C CAT GAGT GTAT CAC T AAAAAAAGT AT CAG GAAT AAAG 1637 

Qy 1317 GAAGAAAT C ACAGAGC C T GAAG GTAT T AGT GT AG C T GTT CAGGAAAC AGAAGCT C CT T AT 1376 

I II I I I I I I I I I I I I I I I I Mill II I I I I I I I I I I I I I I II II I I I I I I I I 

Db 1638 GAAGAAAT T AAAGAGC C T GAAAAT AT T AAT GCAG C T CTT C AAGAAACAGAAG CT C C T TAT 1697 

Qy 1377 ATAT CTATT GCAT GT GAT T T AATT AAAG AAACAAAGAT CT CT ACT GAACC GACT C CAGAT 1436 

II II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III II I I I I I I I I I Ml 

Db 1698 ATAT C TAT T G CAT GT GAT T T AAT T AAAGAAACAAAGC TT T CT GCT GAAC C AGCT C C GGAT 1757 

Qy 14 37 T T C T C T AGT TAT T C AGAAAT AGC AG AAGT T G CAC AGC CAGT G C C C GAG CAT T CT GAGCT A 1496 

I I I I I I I I I I I I I I I I I I II I MINI I I I I I I I I I I II II I II I I I I I I I I I 
Db 1758 TT CT CT GAT TAT T C AGAAAT G GCAAAAGT T GAAC AGC CAGT G C CT GAT CAT T CT GAG CT A 1817 

Qy 1497 GT T GAAGAT TCCTCCCCC GAT T C T GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC C C 1556 

I I I I I I I I I I I I M II I I I II I I I I II I I I I I II I II I I I I I I I I I I I I I I I I I M I 
Db 1818 GT T GAAGAT T CC T CAC CT GAT T C T GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC CT 1877 



Qy 1557 GAAGT T C CACAAAAAC AAGAT GAAGC T GT AAT ACT T GT GAAAGAAAAC C T C AC T GAAAT T 1616 

II I I I I I I I I I I I II I I I I I I I I MM M I I I I I I I I I I I I I M M M M I I 

Db 1878 GAC GT T C CAC AAAAACAAGAT GAAAC T GT GAT G C T T GT GAAAGAAAGT C T CACT GAGAC T 1937 

Qy 1617 T CAT C T GAGT CAAT GACAGGAC AT G AC AAT AAGG GAAAACT CAGT GC T T CAC CAT CAC CT 1676 

I I I I I I I I i I I I I I I II I MM M M M I II M M M M I I M I M M I 

Db 1938 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAAC T CAGT GC T T T GC CACCT 1994 

Qy 1677 GAGGGAG GAAAAC C GT AT T T GGAGT CT T T T CAGC C C AGT T TAG G CAT C AC AAAAG AT AC C 173 6 

M I M I M I M M M I I M I I M M M Ml I I II II II I I I I I I II I I II I I 

Db 1995 GAGG GAGGAAAGC CAT AT T T GGAAT CT T T T AAGC T CAGT T T AGATAAC AC AAAAGAT AC C 2054 

Qy 1737 T T AGCAC CT GAT GAAGT T T CAGC AT T GAC C CAAAAGGAGAAAAT C C C T T T G C AGAT G GAG 17 96 

I I I II I II II I I I I I I I. M I I I I I I II I I I I I II II I I I I I I I I I II I I M I 
Db 2055 CT GT T AC C T GAT GAAGT TT CAACAT T GAGCAAAAAGGAGAAAATT C C T T T GC AGAT GGAG 2114 

Qy 1797 GAGC T CAAT ACT G CAGT T T ATT CAAGT GAT GG C T TAT T CAT T G CT C AG GAAGC AAAC C T A 1856 

II I I I I I I I I I II I I I I I II I I I I I I I I I II I I I I III II I I I I I I I I I II 

Db 2115 GAGCT CAGT ACT G CAGT T T ATT CAAAT GAT GAC T TAT T TAT T T C T AAG GAAG CAC AGAT A 2174 

Qy 18 57 AG AGAAAGT GAAAC AT T T T C AG AT T CAT C T C C GAT T GAG AT TAT AGAT GAGT T C C C GAC C 1916 

I I M I I I MUM II II I I I I I I I I I II I I I I I II I II I II I I II I I II I II M 

Db 2175 AGAGAAACT GAAAC GT T T T C AGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT AC A 2234 

Qy 1917 T T T GT CAGT T CTAAAGC AGAT TCTTCTCC T AC AT T AGC C AGGGAAT ACAC T GAC CT AGAA 1976 

II II I I I I M I II I Mill I I III I I I I II II I I II M I I I M I I I I II I I 

Db 2235 T T GAT C AGT T CTAAAACT GAT T CAT T T T CT AAAT T AGC C AG GGAAT AT ACT GAC C TAGAA 2294 

Qy 1977 GT AG C CC ACAAAAGT GAAAT T GCT GACAT C C AGGAT G GAGCT G G GT CAT TGGCTTGTG C A 2036 

Ml I II I I II I I II I I I I II I I I I II I II I I I II I I I I II I I I I I I I I I II 
Db 22 95 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2354 

Qy 2037 GGAT T G C C C CAT GAC CTTTCTTT C AAGAGT AT ACAAC C T AAAGAGGAAGT T CAT 2 090 

I I I I I I I II I II II II I I I II I II I I I II I II I I I II I I I I I I I 

Db 2355 GAAT T GC C C CAT GAC CT T T CT T T GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 2414 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

M I II I I I I I I I I II I I I II III I II I I I I I I I I II I I I I I I M 
Db 2415 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2474 

Qy 2151 GAT GTTTCTGCTTTG GAT GCT CAAG C AGAG AT AGG CAG C AT AGAAAAAC C CAAAGT T CT T 2210 

II I I I I II I I I II I I I I II II I I I I I M I II I I I I I M I I II II I I I II I I I I 

Db 2475 GAT GTTTCTGCTTTGGC CACT CAAG C AGAGAT AGAGAGC AT AGT TAAAC C CAAAGT T CT T 2534 

Qy 2211 GT GAAAGAAG C C GAGAGAAAACT T C CT T C T G AT AC AGAAAAAGAGC GAAGAT CT C CAT C T 2270 

I I II II I II I I I I I I I M I II II II I I I II I II I I II II I II I I I II I I I I I I 
Db 2535 GT GAAAGAAG CT GAGAAAAAAC TTCCTTCC GAT AC AGAAAAAGAGGAC AGAT CAC CAT CT 2 594 

Qy 2271 GC T AT AT T T T CAG CAG AG C T GAGT AAAAC T T CAGT T GTT GAC CT C CT C TACT GG AGAGAC 2330 

I I I II I II I I II I II I I I II I I I I I I I I I I I I II I I I I I I I I I I II I I I I II I II II I I 
Db 2595 GC T AT AT T T T CAGC AGAG C T GAGT AAAAC T T C AGT T GTT GAC C T C CT GT ACT GGAGAG AC 2654 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I II I II I II I II I M I II I II I I I II II I I I II I I I II I II I I M II I I II I I I I 
Db 2655 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 2714 



Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I II I I I I I II I I I I I I II I I I I II I I I I I I I II I I II II I I I I I I I I I I I I I I 
Db 2715 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 2774 

Qy 2 451 T T T AGGAT AT ATAAG GGT GT GAT C C AG GCT AT C C AGAAAT CT GAT GAAGG C C AC C CAT T C 2510 

I I I I I I I I I I I II I I I I I II I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I 
Db 2 775 T T T AGGAT AT ACAAG GGT GT GAT C CAAGC T AT C C AGAAAT C AGAT GAAGG C C AC C CAT T C 2 834 

Qy 2511 AGGGCATATTT GGAATCT GAAGTT GCT AT AT CTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 

I I I I I I I I I I I I I I II I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 2835 AGG G C AT AT CT GGAAT CT GAAGTT GC TAT AT CT GAG GAGT T GGT T C AGAAGT AC AGT AAT 2894 

Qy 2571 TCTGCTCTTGGT CAT GT T AAC T GCACAAT AAAAGAACT C AGAC GCCTCTTCT T AGT T GAT 2630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 2895 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2 954 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 955 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3014 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I 11 I I I I I I I I I 
Db 3015 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3074 

Qy 2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

I I I I I I I I I I I I I II M I I I II I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I II I 
Db 3075 AT T TAT GAAC G GC AT C AG G C AC AGAT AGAT CAT TAT CT AGGACT T GCAAATAAGAAT GT T 3134 

Qy 2 811 AAAGAT G CT AT GGCTAAAAT C C AAGCAAAAAT C C CT GGAT T GAAGC GTAAAG CT GAAT GA 2 87 0 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I II I I I II I I I I I I I I I I I I I I I II 

Db 3135 AAAGAT GCT AT G GCTAAAAT C C AAGCAAAAAT C C CT GGATT GAAG C GCAAAGC TGAAT GA 3194 

Qy 2 871 GAAAGC CT GAAAGAGTTAACAAT AGAGGAGT T T AT CT T T AAAGG G GAT AT T CAT T T GAT T 2 930 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3195 AAAC GC C CAAAATAATT A GT AGGAGT T C AT CT T T AAAG G G GAT AT T CAT T T GAT T 324 9 

Qy 2 931 C CAT T GGGGAG GGT C AG G GAAGAACAAAGC CT T GACAT T GCAGT G C AGTT T C AC 2 98 4 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I 
Db 3250 AT AC GG GGGAG GGT C AG GGAAGAAC GAA- C CT T GAC GT T GCAGT G C AGTT T C ACAGAT C G 3308 

Qy 2 985 AGAT CT TT AT TT T T AGCAAC G CAGT G- T C T GAGGAAAAAT GAC CT GT C T T GACT G 3038 

I I I I I I I I I I I I I I I I I I III II I MINIMI M I I I I I I I I M II 
Db 3309 T T GT T AGAT CT TT AT TT T TAG C CAT G C ACT GT T GT GAGGAAAAAT T AC CT GT CT T GAC T G 3368 

Qy 3039 C C CT GT GT TC AT CAT CT TAAGT AT T GT AAG CT G C TAT GT AT GGAT T T AAAT C GT AAT CAT 3098 

II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I II II 

Db 3369 C CAT GT GT T CAT CAT C T TAAGT AT T GT AAG C T G C TAT GT AT G GAT T T AAAC C GT AAT CAT 3428 

Qy 309 9 AT TTGTTTTTCCT GT AT GAGG C AC T G GT GAAT AAACAAAGAT C T GAGAAAG CT GT AT AT T 3158 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 342 9 A — TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAAC CT GT AT ATT 3475 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I M 
Db 3476 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGA 3535 



Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATG 3271 



I I III I I I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3536 AAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATG 3595 

Qy 3272 C AG AT T T T C T GAAAT GAAA T GT T T AGAC GAGAT CAT G C C AC CAAG G C AG GAGT GAA 3327 

I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I II II I I I I I I III 

Db 3596 C AGAT T T T CT GAAAT GAAAT GT T T GT T TAG AC GAGAT CAT AC C GGTAAAG CAG GAAT G AC 3655 

Qy 3328 AAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 3387 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3656 AAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTG 3714 

Qy 33 8 8 CCAAT ATAAGTAAAT ATAGATTATATATAT CT AT ATAT AGT GT TT CACGAAGCTT AGCCC 3447 

I I I I II I I I II I I I II I I I I I I I I I III I I I II I I I I I I I I I I I I I II I II 

Db 3715 C C AAT AT AAGT AAAT AT AG AT TATA TAT GTATAGT GTTTCACAAAGCTT AGAC C 37 68 

Qy 3448 T TT AC CT T C C CAG CT GC C C CACAGT GCTT GAT ACT TCTGTCATGGGTTTTATGT 3501 

I I I I I I I I I I I I I I II I I I I I I I I I I I I II I II I II I I I I I I I I I 

Db 3769 T TT AC CT T - C C AGCC AC C C CACAGT GCTT GAT AT T T C AGAGT CAGT CAT T GGT TAT AC AT 3827 

Qy 3502 GTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGT 3561 

I I I I I II I I I I I I I I I I I II I I I I I I I II I I I I I II I I I I I I I I I II I II 

Db 3828 GTGTAGTT C CAAAGCACATAAGCT AGAAGAAGAAATATTTCT AGGAGCACTAC CAT CT GT 3887 

Qy 3562 TTT CAACAC GAAC CGAC GCCAT GCAAACAGAACT CC - T CAACATAAACTT CACT GCACAG 3620 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I 
Db 3888 TTT C AAC AT GAA AT G C C AC AC AC AT AGAACT C C AAC AAC AT C AAT T T CAT T G C AC AG 3944 

Qy 3621 ACT T AC T GT AGT T AAT T TT AT C AC — AAACTCTGGACT GAAT CT AAT GCTTCCAAAAA — 3676 

III I I I I I II I I I I I I II I I I I II II I I I I I I II I I II I I I I I I I I I I I I I I 
Db 3945 ACT GAC T GT AGT T AAT T T T GT C AC AGAAT CT AT GGAC T GAAT CT AAT GCT T C CAAAAAT G 4004 

Qy 3677 T GTTT GCAAATAT CAAACATT GTTAT GTAAGAAAATAT AAAT GAC G A 3723 

I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I III I I I I I I II 

Db 4005 TTGTTTGTTTG C AAAT AT C AAAC AT T GT TAT G CAAG AAAT TAT T AAT T AC AAAAT G AAG A 4064 

Qy 3724 T TT AT ACAAT T GT GGT T T AAGC T GT AT T GAAC T AAAT CT GT G GAAT GC AT T GT GAACT GT 37 83 

I II I II I I I I I I I I I I I I II I I II I I I I II I I I I I I I I I II I I I I I I I I I I M I I I I 1 
Db 4065 T TT AT AC CAT T GT GGT T T AAGC T GTACT GAAC TAAAT CT GT G GAAT GC AT T GT GAAC T GT 4124 

Qy 3784 AAAAG CAAAGT AT CAATAAAGCT T AT AGACT T AAAA 3819 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4125 AAAAG CAAAGT AT CAATAAAGCT TAT AGACT T AAAA 4160 
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REFERENCE 3 (bases 1 to 4789) 

AUTHORS Van der Putten,H. 

TITLE Direct Submission 

JOURNAL Submitted ( 07-MAY-2002 ) Nervous System Research, Novartis Pharma 
Inc. , Basel, Switzerland 
FEATURES Location/Qualifiers 
source 1. .4789 

/organism="Homo sapiens' 1 

/mol_type="mRNA" 

/db__xref="taxon: 9606" 

/ chromosome^ " 2 " 

/map="2pl6" 
gene 1. .4789 

/gene="RTN4" 

/ no t e= " s ynonym : NOGO " 
5*UTR 1. .244 

/gene="RTN4" 

/evidence=experimental 
CDS 245. .3823 

/gene="RTN4" 

/note="NOGO-A; RTN4-A; alternatively spliced" 
/codon_start=l 
/product="RTN4 isoform A" 
/protein_id="AAM64248. 1" 
/db_xref="GI : 26800573" 

/translation="MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED 
EDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAP PVAP 
ERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWT 
PPAPAPAAPPSTPAAPKRRGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTIS 
AGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEV 
SEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKL 
VSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFER 
VWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPST 
PEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEK 
NTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 
TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV 
PSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPEN 
INAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSP 
DSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKP 
YLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 
ETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 
LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKV 
LVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 
LTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELV 
QKYSNS7UJGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 



S LFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE " 
3 f UTR 3824. .4789 

/gene="RTN4" 

ORIGIN 

Query Match 70.9%; Score 2716; DB 9; Length 4789; 

Best Local Similarity 86.0%; Pred. No. 0; 

Matches 3332; Conservative 0; Mismatches 445; Indels 99; Gaps 25; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTT7WVGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I I I I I I I I I I I I I I I I II II 

Db 956 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 1015 

Qy 61 GT AC T GC C CAC T GAAGGAAC AC T T C C AG CAAC T T CAAAT GAAGCT T CTAAAGCAT T CT C A 120 

III I I M II I I I I I I I I I I I I II II II I I I I I I I I I I I II I I I I I I I I 

Db 1016 GT AT T AC C CAC T GAAGGAAC AC T T CAAGAAAAT GT CAGT GAAGCT T CT AAAGAG GT CT C A 107 5 

Qy 121 GAGAAGGCAAAAAAT CCATTT GT AGAGAGAAATT TAACAGAATTT T C AGAATT GGAATAT 180 

I I I I II I I I I I I I II I I I I I I III I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 1076 GAGAAGGCAAAAACT CT ACT CAT AGAT AGAGATT TAACAGAGTTT T CAGAATTAGAATAC 1135 

Qy 181 T CAGAAAT GGAAT CAT C ATT CAGT GG CT CT CAAAAGG C AGAAC CT GC C GTAAC AGTAG C G 2 40 

I I I I I I Ml I I I I I 1 I I I I I I I I I I I I I III I I I I I I I I II I I I I I I I I I I I 
Db 1136 T CAGAAAT GG GAT CAT C GT T CAGT GT CT CT C CAAAAG CAGAAT CT GC C GTAATAGTAG C A 1195 

Qy 2 41 AAT C CT AG GGAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAGGACT T AGT T AGT CT T 300 

I I I I I I I I I I I I I I I I I 1 1 I I I I II I I II II I II I I I 1 I I I I I I I I I I 

Db 1196 AAT C CT AG GGAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 1255 

Qy 301 AACAT CCTT CATACT CAGCAGGAGTTAT CT ACAGT CCTTAC GAAAT C AGTT GAAGAAGAA 360 

II I I I I I I I I I I I III II I I I I I I I I I I I I II I I I I I I I III I I I I II 

Db 1256 AAC AT CCT T CAT AAT C AACAAG AGT T AC CT ACAGCT CTT AC TAAAT T G GT T AAAGAGGAT 1315 

Qy 361 GAT AGAGT TCTGTCTC CAGAAAAAACAAAGGAC AGT TT T AAG GAAAAG GGAGTT G CAGC A 420 

II I I I I Mill I I II I I I I II I I I I I II I II I M I I I I M I II I I I I II 
Db 1316 GA AGT T GT GT CTT C AGAAAAAG CAAAAGAC AGT TT TAAT GAAAAGAGAGTT G CAGT G 1372 

Qy 421 GAAGC TT C TAT G G GGGAG GAAT AT G C AGACT T CAAAC CAT T T GAGC G AGT AT GGGAAGT G 4 80 

II II I I II I I I II II I II II I I I I M II I I I I M I I I I II II I II I I I I I I II II I II 
Db 1373 GAAGCT C C TAT GAGGGAG GAAT AT G C AGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 1432 

Qy 481 AAAGAT AC T T ACAAGCAAGAT AGT GAT GT T T T GAT T G C T G GAGGT AAT AT AGAGAGCAAA 540 

M II II I I III II M II I I I I I I Ml I I I I I II I I I I I II I I M II I I 
Db 1433 AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 14 89 

Qy 541 T T GGAAG GTAAAGT G GAT AAGAAAC ACT T T T C AGAT AG CCTT GAACAAAC AAAT C GT GAA 600 

MUM I I II II I II II I I III III M I II I I I I I I I I I I I M I I I I III 
Db 1490 T T GGAAAGTAAAGT G GAT AAAAAAT GT T T T GC AGATAGC CTT GAG CAAACT AAT CAC GAA 1549 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GAC AC T T CAT T T C C CAGT AC AC C AGAAG C T GT AAG A 660 

I II I I I I II I I II M I I I II I I M I I I I I II I I II I I I I I I II I I I I II I 
Db 1550 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T C T T T C C C CAGT AC G C C AGAAG GT AT AAAG 1609 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAAT GTT 717 

I I MM Mill II Mill I I I I II II I I M II M M I I I I II I I I I I 

Db 1610 GAT C GT T CAG GAG CAT AT AT CAC AT GTGCTCCCTT T AAC C C AG C AGCAACT GAGAG CAT T 1669 



Qy 718 T C AACAAACAT TTTTCCCTT GT T G GAAG AT CAT ACT T C G GAAAAT AAGAC AGAT GAAAAA 77 7 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1670 G C AACAAACAT TTTTCCTTT GT TAG GAG AT C CT ACT T C AG AAAAT AAGAC C GAT GAAAAA 1729 

Qy 77 8 AAGAT AGAA- AAAAAAAGGCAC AAAT T GT AACAGAGAAGAAT GCAAGT GT CAAGACAT C A 836 

II I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I II Ml I I I I I I 
Db 1730 AAAAT AGAAGAAAAGAAGGC C CAAATAGT AACAGAGAAGAAT ACT AG C AC CAAAACAT C A 17 8 9 

Qy 837 AAC CCTTTCCT TAT G G C AG C AC AGG AGT CT AAGAC AGAT T AC GT TACAAC AGAT CAT GT G 8 96 

I I I I I I I I III I I I I I I I I I I I I II I I I I I I I I II I II I I I I I I I I I II I 
Db 17 90 AAC CCTTTTCTT GT AG C AG C AC AGG AT T CT GAGAC AGAT TAT GT C ACAAC AGAT AAT T T A 184 9 

Qy 897 T C AAAG GT GAC C GAG GAAGT AGT GG CAAAC AT G C C T GAAGGT CTAAC CC C AGAT T T GGT T 956 

I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 1850 AC AAAG GT GAC T GAG GAAGT C GT GG CAAAC AT G C C T GAAGG C CT GACT C C AGAT T T AGT A 1909 

Qy 957 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGCTACT GGTACAAAAATTGC CTTT GAAACA 1016 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1910 C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGT TAC T GGT ACAAAGAT T GC T TAT GAAACA 1969 

Qy 1017 AAAAT GGAC C T GGT T CAAAC T T CAGAAG CT GT G C AG GAGT C ACT T T ACC C T GT AACAC AG 1076 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I II I I 
Db 197 0 AAAAT GGACT T GGT T CAAAC AT C AGAAGT T AT G CAAGAGT C ACT C TAT C CT GC AG CAC AG 2 029 

Qy 107 7 CTTT GCC CAT CTTTTGAAGAATCT GAAGCTACT CCGTCACCGGTTTTGCCTGACATTGTC 113 6 

I I I I I I II I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 2 030 CT T T GC C CAT CAT T T GAAGAGT C AGAAG CT ACT C CT T CAC C AGT T T T GC CT GAC AT T GT T 2 08 9 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 2090 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2149 

Qy 1197 T CAT CAC CAT T AGAAACT C T T C C T T C AGT T AAT TAT G AAAG C AT AAAGT T T GAGC C T GAA 125 6 

I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I 

Db 2150 T CAT CAC CATT AGAAGC T T C T T C AGT T AAT TAT GAAAG CAT AAAAC AT GAGC C T GAA 220 6 

Qy 1257 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I M 

Db 22 07 T^ACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2266 

Qy 1317 GAAGAAAT CAC AGAG C CT GAAG GT AT T AGT GT AG CT GTT C AG GAAAC AGAAGCT C C T TAT 137 6 

I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I II I 

Db 22 67 GAAGAAAT TAAAGAGC CT GAAAAT ATTAAT GCAGCT CTT C AAGAAAC AGAAGCT CCT TAT 2 32 6 

Qy 1377 AT AT CT AT T GCAT GT GAT T T AAT T AAAGAAACAAAGAT C T CT ACT GAAC C GACT C C AGAT 1436 

II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I III I M I I I I I I I I I M 

Db 2327 AT AT CT AT T GCAT GT GAT T TAAT T AAAGAAACAAAG CTT T CT GCT GAAC C AGCT C C GGAT 238 6 

Qy 1437 T T C T C T AGT TAT T C AG AAAT AG C AG AAGT T G CAC AG C C AGT G C C C GAG CAT T C T GAG C T A 14 96 

I I I I I I I I I I I II I II I I III I I I I I I I I I II I I I I II I II I I I I I I I I I I I I 
Db 2387 TT C T CT G AT TAT T CAGAAAT GGCAAAAGT T GAAC AG C C AGT G CCT GAT CAT T CT GAGCT A 244 6 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T C T GAAC C AGT T GACT TAT T T AGT GAT GAT T CAAT AC C C 1556 

I I I I I I I I II I I I I II II II I I I I I I II I I I I I I I I I I I II I I II I I I I I I I M I I I 
Db 2447 GT T GAAGAT T CC T CAC CT GAT T CT GAAC C AGT T GACT TAT T T AGT GAT GAT T CAAT AC C T 2506 

Qy 1557 GAAGT T C CACAAAAACAAGAT GAAGCT GTAAT ACT T GTGAAAGAAAACCT CACT GAAATT 1616 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 

Db 2507 GAC GT T C CACAAAAACAAGAT GAAACT GT GAT GCTT GT GAAAGAAAGT CT CACT GAGACT 2566 

Qy 1617 T CAT CT GAGT CAAT GAC AGGACAT GAC AAT AAG G GAAAAC T C AGT GCTT C AC CAT CAC CT 167 6 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 2567 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAAC T C AGT G C T T T G C CACCT 2623 

Qy 1677 GAGG GAGGAAAAC C GT AT T T GGAGT C T T T T CAGC CC AGT T T AGGC AT C AC AAAAG AT AC C 1736 

I I I I I I I I I I I II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 2624 GAGGGAGGAAAG C CAT AT T T GGAAT C T T T T AAGCT C AGT T T AGAT AAC AC AAAAG AT AC C 2683 

Qy 1737 T TAG CAC CT GAT GAAGT T T CAGC AT T GAC C CAAAAG GAGAAAAT C C CT T T G C AGAT G GAG 1796 

I I I I I I I I I I I I I I I II Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 68 4 CT GT T AC CT GAT GAAGT T T C AAC AT T GAG C AAAAAGGAGAAAAT T C CT TT G C AGAT G GAG 27 4 3 

Qy 17 97 GAG CT CAAT ACT GC AGT T TAT T CAAGT GAT GGC T TAT T CAT T GCT CAG GAAG CAAAC C T A 1856 

I I I I I I I I I I I I I I I I I I I I II I I Mill II I I I I III II I I I I I I I I I II 

Db 2744 GAGCT CAGT AC T GC AGT T TAT T CAAAT GAT GAC T TAT T T AT TT C T AAGGAAG CAC AGAT A 2 8 03 

Qy 1857 AG AG AAAGT G AAAC ATT T T C AGAT T CAT C T C C GAT T GAGAT TAT AG AT GAGT T C C C GAC C 1916 

II I I I I I I I I I M II I I I I I I II I I I I I II I I I I I I I I I I II I I I I I I I I I I II 

Db 2 804 AGAGAAACT GAAAC GTT T T C AGAT T CAT C T C CAATT GAAAT TAT AGAT GAGT T CC CT AC A 2 863 

Qy 1917 T TT GT CAGTT CTAAAGCAGATT CT T CT CCT AC ATTAGCCAGGGAATACACT GACCTAGAA 1976 

II I I I I I I I II I I I Mill I I III I I I II I II I I I I I I I I I I I I I I I II I I 

Db 28 64 T T GAT CAGTT CTAAAAC T GAT T CAT T T T C T AAAT TAG C CAG GGAAT AT ACT GACCTAGAA 2923 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

III I I I I I I I I I I I I I I I II I I I I II I I I I 1 I II I I I I II I I II I I I I I II 

Db 2924 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2983 

Qy 2037 G GAT T GC C CCAT GAC CT T T CT T T C AAGAGT AT ACAAC C TAAAGAG GAAGT T CAT 2090 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I 

Db 2984 GAAT T GC C CCAT GAC CT T T CT TT GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 3043 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I II I I II II I I I I I I I I I Ml II I I I I I I II I I I M I I I I I I 
Db 3 04 4 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 3103 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAGC AGAGAT AG GCAGCAT AGAAAAAC C CAAAGT T CT T 2210 

I I I I I I I I I I II I I I I I I I I II I I II II I I I I I I I I I I II I I II II I I I I II I 
Db 3104 GAT GTTTCTGCTTTG GC CAC T CAAGC AGAGAT AGAGAGC AT AGT T AAAC C CAAAGT T CT T 3163 

Qy 2211 GT GAAAGAAG C C GAGAGAAAACT T C CT T CT GAT AC AGAAAAAGAG C GAAGAT C T C CAT CT 2270 

I I I I I I I M I I II I I I I II I II I II I I II I I I I I I I I I I I II Mill I I I I I I 
Db 3164 GT GAAAGAAGCT GAGAAAAAACT T C CT T C C GAT AC AGAAAAAGAG GAC AGAT CAC CAT CT 3223 

Qy 2271 GCT AT AT T T T CAG C AGAG CT GAGT AAAACT T C AGTT GT T GACC T CCT CT ACT G GAGAGAC 2330 

I M II II I I I II I II I I II I II I I II I I I I I I II I I I I I II I I I II I I I I I I I I I I I I I 
Db 3224 G CT AT AT T TT CAG C AGAG CT GAGT AAAACT T CAGTT GTT GACCT CCT GT ACT G GAGAGAC 3283 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I II II I II I I I I I I I II I II II II I I I I I I I II I II I II I I I I M II 
Db 3284 ATTAAGAAGACT GGAGT GGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3343 



Qy 



2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2 450 
I II I I II II I I I I I I I I II I II I I I I I II I I II I I I II I II I I I II I I I I I I M I I I 



Db 



3344 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 34 03 



Qy 2451 T T T AGGAT AT AT AAG G GT GT GAT C CAGG C TAT C C AGAAAT C T GAT GAAG G C CAC C CAT T C 2510 

I I I I I I II I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 04 T T T AGGAT AT AC AAG G GT GT GAT C C AAG C TAT C C AGAAAT CAGAT GAAG G C CAC C CAT T C 34 63 

Qy 2511 AGG G CAT AT T T G GAAT CT GAAGT T G CT AT AT CT GAG GAGT T GGT T C AGAAGT AC AG C AAT 257 0 

I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
Db 34 64 AGGGCATAT CT GGAAT CT GAAGT T GCT AT AT CT GAG GAGT T GGTT C AGAAGT AC AG T AAT 3523 

Qy 2571 TCTGCTCTTGGT CAT GTT AACT G C ACAAT AAAAGAAC T CAGAC GC CT C T T C TT AGT T GAT 2 630 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3524 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3583 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 690 

I I I I I I I I I M I I I I I I I I II I I I I I I I I I II I I I I I II I I I I I I I II I I II I I I I I I I I 
Db 3584 GAT T T AGT T GATT CT CT GAAGT T T G CAGT GTT GAT GT G GGT AT TT AC CT AT GT T G GT G C C 3643 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3644 TTGTTTAATGGTCTGACACTACT GATT TTGGCTCTCATTTCACTCTT CAGT GTT CCTGTT 3703 

Qy 2751 AT T TAT GAAC G GC AT C AGGC GCAAAT AGAT CAT TAT CT GGGAC TT GCAAAT AAGAAT GT T 2810 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 37 04 AT T TAT GAAC GGC AT C AGGC ACAGAT AGAT CAT TAT CTAGGAC TT GCAAAT AAGAAT GT T 37 63 

Qy 2811 AAAGAT GC T AT GGC T AAAAT C CAAGCAAAAAT C C C T G GAT T GAAGC GT AAAGCT GAAT GA 2 87 0 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 37 64 AAAGAT GCT AT GGC T AAAAT C CAAGCAAAAAT C C C T G GAT T GAAG C GC AAAGC T GAAT GA 3 82 3 

Qy 2871 GAAAGC CT GAAAGAGT TAACAATAGAGGAGTTT AT CTTTAAAGGGGAT ATT CATTT GATT 2 93 0 

I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 3824 AAAC GC C C AAAAT AAT T A GTAGGAGTT CAT CTTTAAAGGGGAT ATT CATTT GATT 3878 

Qy 2931 C CAT T G GGGAG GGT C AGGGAAGAACAAAGC CT T GACAT T G CAGT G C AGT TT CAC 2984 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I 
Db 3879 AT AC GGGGGAGGGT C AGGGAAGAAC GAA- C CT T GAC GT T GC AGT GC AGT T T C ACAGAT C G 3937 

Qy 2985 AGATCTTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTG 3038 

I I I I I I I I I I I I I I I I I I II I II I I I I 1 I I I I I I I I I II I I II I I I II I 
Db 3938 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 3997 

Qy 3039 CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 3098 

II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3998 C CAT GT GT T CAT CAT C T T AAGT AT T GT AAGCT GC T AT GT AT G GAT T T AAAC CGTAAT CAT 4057 

Qy 3099 AT TTGTTTTTCCT GT AT GAG GCACT GGT GAAT AAAC AAAGAT C T GAGAAAG CT GT AT AT T 3158 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 058 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC CT GT AT AT T 4104 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I I I I I I I I I I I II I I I I I I I II I I II I I I I I I III I I I I I I I I 
Db 4105 T T ACTT T GT T G CAGAT AGT CT T GC C GC AT CTT GG CAAGT T G C AGAGAT GGT GGAG CT AGA 4164 

Qy 3215 GAC AGAAAT AAC C C T T T T CAC AGT T T GT G CAC T GT GT AC GGT CT GT GT AG GT T GAT G 3271 

I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4165 AAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATG 4224 



Qy 3272 C AG AT T T T C T GAAAT G AAA T G T T TAG AC GAG AT C AT G C C AC C AAG G C AG GAG T GAA 3327 

I I I I I I I I II I I I I I I I I I I I I I II I I I I I II I I I I II II I I I I I I III 

Db 4225 C AGAT T T T C T GAAAT GAAAT GT T T GT T TAG AC GAG AT CAT AC C GGT AAAGC AGGAAT G AC 42 84 

Qy 3328 AAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTG 3387 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 42 85 AAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTG 43 43 

Qy 3388 C CAAT AT AAGT AAAT AT AGATT AT AT AT AT C T AT AT AT AGT GT T T C AC GAAG C T T AGC C C 3447 

I I II I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I II II I I I I I I II 

Db 4344 C CAATAT AAGTAAATAT AGATT AT A TAT GT AT AGT GT T T C AC AAAG CT T AGAC C 4397 

Qy 3448 TTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT T CT GT CAT GGGTTTT AT GT 3501 

I III I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 439 8 TTTACCTT-CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCATTGGTTATACAT 44 56 

Qy 3502 GTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGT 35 61 

I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 4457 GT GT AGTT CCAAAGCACATAAGCTAGAAGAAGAAATATTT CTAGGAGCACT AC CAT CT GT 4516 

Qy 3562 T T T CAACAC GAAC C GAC G C CAT GCAAACAGAACT C C - T CAAC AT AAACT T C ACT GCAC AG 3620 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II 

Db 4517 T T T CAAC AT GAA AT GC CACACACATAGAACT CCAACAACAT CAATTT CATT GCACAG 4573 

Qy 3621 ACT TACT GT AGT T AAT T T TAT C AC — AAACT C T G GAC T GAAT CTAAT GCT T C CAAAAA — 3676 

III I I I M I I I I M I I I I I I I I II II I I I I I I I I I I I I I II I II I I I I I I II 

Db 4574 ACT GACT GT AGT T AATT T T GT C ACAGAAT CT AT GGACT GAAT CTAAT GCT T C CAAAAAT G 4633 

Qy 3677 T GT T T GC AAAT AT C AAAC AT T GT TAT GT AAGAAAAT AT AAAT GAC G A 3723 

I M I I I I I I I I I I I II I I I I I I I I I I I I I I I II III I I I I I I II 

Db 4634 T T GT T T GT T T GCAAAT AT CAAACAT T GT TAT G CAAGAAAT TAT TAAT T ACAAAAT GAAGA 4 693 

Qy 3724 T T T AT ACAAT T GT GGT T TAAGCT GT AT T GAAC TAAAT C T GT GGAAT GC AT T GT GAACT GT 37 83 

I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M 
Db 4 694 T T TAT AC CAT T GT GGT T T AAGC T GT ACT GAAC TAAAT CT GT GGAAT GC AT T GT GAACT GT 4753 

Qy 37 84 AAAAGCAAAGT AT CAAT AAAGC T T AT AGACT T AAAA 3819 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 47 54 AAAAGCAAAGT ATCAATAAAGCTTATAGACTTAAAA 478 9 
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ORIGIN 

Query Match 70.1%; Score 2687.8; DB 6; Length 4822; 

Best Local Similarity 85.6%; Pred. No. 0; 

Matches 3333; Conservative 0; Mismatches 458; Indels 104; Gaps 27; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 964 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 1023 

Qy 61 GT ACT GC C CAC T GAAG GAAC ACT T C CAGC AAC TT CAAAT GAAGC T T CT AAAG CAT T CT C A 120 

III I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I II I 

Db 1024 GT AT T AC C CACT GAAG GAAC ACT T C AAGAAAAT GT CAGT GAAGC T T CT AAAG AGGT CT C A 1083 

Qy 121 GAGAAGGCAAAAAAT C CATTT GT AGAGAGAAATTTAACAGAATT TT CAGAATT GGAAT AT 180 

I I I I I II I I I I I I II I I I I I I III I II I I I I I I I I I II I I I I I I I I I I I I 
Db 1084 GAGAAGGCAAAAACT CTACTCATAGAT AGAGATTTAACAGAGT TTT CAGAATT AGAAT AC 1143 

Qy 181 T C AGAAAT GGAAT CAT CAT T CAGT G GCT C T CAAAAGGC AGAAC CT G C C GT AAC AGT AGC G 24 0 

I I I I I II I I I I I I I I I I I I II I I I I I I I III II II I I I I I I I I I I I I I I I I I 
Db 1144 T C AGAAAT GGGAT CAT C GT T CAGT GT CT C T C CAAAAGC AGAAT CT G C C GT AAT AGTAGC A 1203 

Qy 241 AAT C C TAGGGAC GAAAT AGT T GT GAGGAGTAGAGAT AAAGAAGAGGACT T AGTT AGT C T T 300 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I M I I I I I I I I 
Db 1204 AAT C C TAGGGAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGTT AGT AAT 1263 

Qy 301 AAC AT C CT T CAT ACT CAGCAGGAGT TAT CT AC AGT C CT T AC GAAAT CAGT T GAAG AAGAA 360 

I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I III I I I I II 

Db 12 64 AAC AT C CT T CAT AAT CAACAAGAGT T AC CT AC AGCT CT T ACTAAAT T GGT T AAAG AGGAT 132 3 

Qy 361 GAT AGAGT T CT GT CT C CAGAAAAAACAAAGGACAGT T T TAAG GAAAAGG GAGTT GCAGC A 420 

II I I I I I I I I I I I I I I I II MM I I I I II II I I I II I I I I M I M M I I 

Db 1324 GA AGTT GT GT CTT CAGAAAAAGCAAAAGACAGTTTTAAT GAAAAGAGAGTT GCAGT G 1380 

Qy 421 GAAGCTT CT AT G GGGGAGGAAT AT G CAGAC T T CAAAC CAT T T GAGC GAGT AT GGGAAGT G 480 

II I I I I I II I I I I I I I I I I I I I I II II II I I I I I I I I I I I II I II I II I I M I I I M I 
Db 1381 GAAGCT C CT AT GAG GGAGGAAT AT G CAGAC T T CAAAC CAT T T GAGC GAGT AT GGGAAGT G 144 0 

Qy 4 81 AAAGAT AC T T ACAAGCAAGAT AGT GAT GT TTT GAT T GCT G GAGGT AAT AT AGAGAGC AAA 540 

I II I I II I III I I I I II II I I I Mill I I I I I I I I I I I I II I I I I I I I I 
Db 1441 AAAGATAGT AAGGAAGATAGT GAT AT GTT GGCT GCT GGAGGTAAAAT CGAGAGCAAC 1497 

Qy 541 T T GGAAG GTAAAGT GGAT AAGAAAC ACT T T T C AGAT AGC CTT GAACAAACAAAT C GT GAA 600 

I I I I I I I I I I II I I I I I I I III Ml I I I I I I I II I I I I I I I I I I I I I M I 

Db 14 98 TT GGAAAGT AAAGT G GATAAAAAAT GT T T T G CAGAT AGC C T T GAGCAAACT AAT CAC GAA 1557 

Qy 601 AAAGATAGT GAAAGCAGTAAT GAT GACACTT CATTT CCCAGTACACCAGAAGCT GTAAGA 660 

II I I I I M I I I II I I I I I I M I I I Mill II I I I II I I I I I I I I I I I IN 

Db 1558 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T CTTT CC C CAGT AC GC C AGAAG GT AT AAAG 1617 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I II I I Mill II I I I I I I I I I I I M I I I I II I I II I I II I I I I I M 
Db 1618 GAT C GTT C AG GAGC AT AT AT C ACAT GTGCTCCCTT T AAC C CAGC AG CAACT GAGAGC AT T 1677 



Qy 718 T C AAC AAAC AT TTTTCCCTT GT T GGAAG AT CAT ACT T C GGAAAAT AAGAC AGAT GAAAAA 777 

II I II I I I I II II I I I I I I I I I Mill MINI I II II I I I I II I I I I I I II I 
Db 1678 G CAACAAAC AT TTTTCCTTT GT TAGGAG AT C CT AC T T CAGAAAAT AAGAC C GAT GAAAAA 1737 

Qy 778 AAGA- - T AGAAAAAAAAAG GC AC AAAT T GT AAC AGAGAAGAAT G C AAGT GT CAAGACAT C 835 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I 

Db 1738 AAAAAT AGAAGAAAAGAAG GC C CAAAT AGT AACAGAGAAGAAT AC T AGCAC CAAAACAT C 1797 

Qy 836 AAAC C C - T T T C CT T AT G GCAGC AC AG GAGT C T AAGAC AGAT T AC GT T ACAACAGAT CAT G 894 

I I I I I I III III I I I I I I I I I I I I III I I I I I I I I I I II I I I I I I I I I II 
Db 17 98 AAAC CCT T T T ACT T GT AGCAGC AC AG GAT T C T GAGAC AGAT TAT GT C AC AAC AGAT AAT T 18 57 

Qy 895 T GT C AAAGGT GAC CGAG GAAGT AGT G GC AAAC AT G C C T GAAGGT CT AAC C C C AG AT TT G G 954 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II MINIMI 
Db 1858 T AAC AAAGGT GACT GAG GAAGT C GT G GCAAAC AT G C C T GAAG G C CT GAC T C C AGAT TT AG 1917 

Qy 955 T T C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAG CT AC T GGT ACAAAAAT T G CC T T T GAAA 1014 

I I I I I I I I II I I I I I II I I I II II I II I I I I I I I I II I II I II I I II I I I Mill 
Db 1918 T AC AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGT T AC T GGT ACAAAGAT T G CT TAT GAAA 1977 

Qy 1015 C AAAAAT GGAC CT GGT T CAAAC T T C AGAAGC T GT GCAGGAGT CAC T T T AC CCT GT AAC AC 1074 

M M I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I II I I I I I III 
Db 1978 CAAAAAT GGACTT GGTT CAAACAT C AGAAGT TAT GCAAGAGT CACT CT AT C CT GCAGCAC 2037 

Qy 1075 AGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTG 1134 

I I I I I II I I II I I I I I I II I I II II I I I I I I I II Mill I M I I I I I I I I I I I I I 
Db 2038 AGCT TT GC C CAT CAT T T GAAGAGT C AGAAGC TACT C CT T CAC CAGT T TT G CCT GACAT T G 2 097 

Qy 1135 TCATGGi\AGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCA 1194 

I I I I I I I II I I I I I I I II II M II I I I I I I I I II I I M I I I I I I I I I I I II 
Db 2098 TTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCA 2157 

Qy 1195 GT T CAT CAC CAT T AGAAAC TCTTCCTT CAGT T AAT TAT GAAAGC AT AAAGT T T GAGCCT G 1254 

I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I 

Db 2158 GCTCATCACCATTAGAAGC TT CT T CAGT TAAT TAT GAAAGC AT AAAAC AT GAG C CT G 2214 

Qy 1255 AAAAT C C C C CAC CAT AT GAG GAGGC CAT GAAT GT AT C ACT AAAAAAAGAAT C AGGAAT G A 1314 

II II II I I I M I II I I II I I I I I I I I I I I I II I II I M I I I I II I I I I I I I I II I 

Db 2215 AAAAC C C C C CAC CAT AT GAAGAGGC CAT GAGT GT AT C ACT AAAAAAAGT AT C AGGAAT AA 2274 

Qy 1315 AT GAAGAAAT CAC AGAG CCT GAAGGT AT T AGT GT AG C T GT T CAGGAAAC AGAAGCT C CT T 1374 

I I I I I I I M I I I I I I I I I I I I I I I I II I I I I II II I I I I II M II I I I I II 

Db 2275 AG GAAGAAAT T AAAGAG CCT GAAAAT AT TAAT G C AG C T CT T CAAGAAAC AGAAGCT C CT T 2 334 

Qy 1375 AT AT AT C TAT T GCAT GT GAT T TAATT AAAGAAACAAAGAT C T CT AC T GAAC C GACT C C AG 1434 

II I I I I I I II II I II I I M I M II I I M I I I I II I I I I I III I I I I I I I I I I I I 

Db 2335 AT AT AT CT AT T GCAT GT GAT T TAAT T AAAGAAACAAAGCT T T CT GC T GAAC CAG CT C C GG 2394 

Qy 14 35 AT T T CT CT AGT TAT T CAGAAATAGC AGAAGT T G CAC AG CC AGT GC C C GAG CAT T CT GAGC 14 94 

I I I I II I I I I II I I I M II I III I I I I I I I I I I I I I I I II I II I I I I I I I I II 

Db 2395 AT T T CT CT GAT TAT T CAGAAAT GGCAAAAGT T GAACAG CCAGT GC CT GAT CAT T C T GAG C 2454 

Qy 1495 T AGT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC 1554 

II II I II I II M I II I II I II I II II I I I II I I II I I I I I I II I I II II I II I II I II 

Db 2455 TAGT T GAAGATTCCT CACCT GAT T CT GAACC AGTT GACTTATTTAGT GAT GAT T CAAT AC 2 514 



Qy 



1555 C C GAAGT T C C AC AAAAACAAGAT GAAGCT GT AAT AC T T GT GAAAGAAAAC CT CAC T GAAA 1614 



I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II I I 

Db 2515 CT GAC GTT C CACAAAAACAAGAT GAAACT GT GAT GCTT GT GAAAGAAAGT CT CACT GAGA 2574 

Qy 1615 T T T CAT C T GAGT C AAT GAC AG GAC AT GAC AAT AAG GGAAAAC T C AGT GCTT C AC CAT C AC 1674 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 2575 CT T CATT T GAGT CAAT GAT AGAAT AT GAAAAT AAG GAAAAAC T CAGT GCTTTGC - — C AC 2 631 

Qy 1675 CT GAG GGAGGAAAAC C GT AT TT GGAGT CT T TT C AG C C C AGT T T AGGC AT C ACAAAAG AT A 1734 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I III I I I I I M I I I I I I I I I I I I I 

Db 2632 CT GAGGGAGGAAAGC CATAT TT GGAAT CTTTTAAGCT CAGTTTAGATAACACAAAAGATA 2691 

Qy 1735 CC T T AGC AC CT GAT GAAGT T T CAGCAT T GAC C CAAAAGGAGAAAAT C C C T T T G C AGAT G G 1794 

II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I 

Db 2692 CC CT GTTAC CTGAT GAAGTTT CAACATT GAGCAAAAAGGAGAAAATT CCTTT GCAGAT GG 2751 

Qy 1795 AG GAG CT CAAT ACT G CAGT T TAT T CAAGT GAT GGC T TAT T CAT T G CT CAGGAAGCAAAC C 1854 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I III II I I I I I I I I I 
Db 2752 AG GAG CT CAGT ACT G CAGT T TAT T CAAAT GAT GAC T TAT T TAT T T CTAAGGAAG C AC AGA 2811 

Qy 1855 TAAGAGAAAGT GAAAC AT TT T C AGATT C AT CT C C GAT T G AGAT TAT AGAT GAGT T C C C GA 1914 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 2812 TAAGAGAAACT GAAAC GT TT T C AGATT CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C C T A 2871 

Qy 1915 CCTTT GT C AGTT CT AAAG C AGAT T CTT CT C CT ACAT TAGC C AG G GAAT AC ACT GAC CT AG 1974 

I II I I I I I I I I I I I I I I I I I I I III I I I I I II I I I I I I I I I I I I I I I I I I 
Db 2872 CAT T GAT C AGTT CT AAAACT GAT T CAT T T T CTAAAT TAGC C AG GGAAT AT ACT GAC CT AG 2931 

Qy 1975 AAGT AGC C C ACAAAAGT GAAAT T G CTGAC AT C C AG GAT GGAGCT GGGTC AT T G G CT T GT G 2034 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 2932 AAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCA 2 991 

Qy 2035 C AG GAT T G C C CC AT GAC CTTTCTTT CAAGAGT AT ACAAC C T AAAG AG GAAG T T C 2088 

III I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I 

Db 2992 CAGAAT T GC C CC AT GAC CTTTCTTT GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT C A 3051 

Qy 208 9 AT GT C C C AGAT GAGT T C T C CAAAGAT AGGG GT GAT GT TT CAAAG GT GCC C GT ACT GC C T C 214 8 

I I I I I I I I I I II I I I I I I I I I III I I I I I I I I I I I I I I I II I I 
Db 3052 GTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTC 3111 

Qy 2149 CAGAT GT TTCTGCTTTG GAT GCT CAAG CAG AGAT AG G C AG CAT AGAAAAAC C CAAAGTT C 2208 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I M I I I 
Db 3112 CAGAT GT TTCTGCTTTGGC CACT CAAG C AGAGAT AGAGAG CAT AGT T AAAC C CAAAGTT C 3171 

Qy 2209 T T GT G AAAG AAG C C G AG AG AAAAC TTCCTTCT G AT AC AG AAAAAG AG C GAAG AT C T C CAT 2268 

I I II II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II Mill I I I I 
Db 3172 TTGT GAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGAT CACCAT 3231 

Qy 22 69 CT G C TAT AT T TT CAG CAGAG CT GAG- T AAAAC T T C AGTT GT T GAC CT C C T C TACT G GAGA 2 32 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 3232 CTGCTATATTTTCAGCAGAGCTGAGCTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGA 3291 

Qy 232 8 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACA 2 38 7 

I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I II I I I I I I I I I I I I I II I I I I I 
Db 3292 GACATTAAGAAGACT GGAGT GGTGTTTGGTGCCAGCCTATTCCT GCT GCTT T CATT GACA 3351 

Qy 238 8 GTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATC 2 44 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I Ml 



Db 



3352 GTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATC 3411 



Qy 244 8 AGC T T T AGGAT AT AT AAG G GT GT GAT C C AGG CT AT C CAGAAAT C T GAT GAAGG C C AC C C A 2507 

I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 3412 AG C T T TAG GAT AT AC AAG G GT GT GAT C C AAG C TAT C CAGAAAT CAGAT GAAG G C C AC C C A 3471 

Qy 2508 TT C AGGG CAT AT T T G GAAT CT GAAGT T G CT AT AT C T GAG GAGT T GGT T C AGAAGT AC AGC 2567 

I I I I I I II I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3472 TT C AGGG CAT AT CT GGAAT CT GAAGT T GCT AT AT CT GAG GAGT T GGT TC AGAAGT AC AGT 3531 

Qy 2568 AATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTT 2 627 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I II I I II I I I I 

Db 3532 AATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTT 3591 

Qy 2628 GAT GATT T AGTT GAT T C T C T GAAGT TT G CAGT GT T GAT GT G GGT AT T TAC CT AT GT T GGT 2687 

I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 3592 GAT GATT T AGT T GAT T CT C T GAAGT T T G CAGT GT T GAT GT G G GT AT T TAC CT AT GT T G GT 3651 

Qy 2688 GCCTTGTTCAAT GGT CTGACACTACTAATTTTGGCTCT GAT TTCACTCTT CAGT GTTCCT 2747 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3652 GCCTTGTTTAAT GGT CTGACACTACTGATTTTGGCTCT CAT TTCACTCTTCAGT GTTCCT 3711 

Qy 2748 GT TAT TT AT GAAC GGC AT C AGGC G CAAAT AGAT CAT TAT CT G GGACTT GCAAATAAGAAT 2807 

I I I I I I I I I I I I I II I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 3712 GT T AT TT AT GAAC GGC AT C AGGC AC AGAT AGAT CAT TAT CTAGGACTT GCAAATAAGAAT 3771 

Qy 28 08 GT T AAAGAT GCT AT GGCT AAAAT C CAAGCAAAAAT C C CT GGAT T GAAGC GT AAAG CT GAA 2 867 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I! I I II I I I I I I II I I! I I I I I I 

Db 3772 GT T AAAGAT GCT AT G GCT AAAAT C CAAG CAAAAAT C C CT GGAT T GAAG C GCAAAG CT GAA 3831 

Qy 28 68 T GAGAAAG C CT GAAAGAGT T AACAAT AGAGGAGT T TAT C T T TAAAGGGGAT AT T CAT T T G 2 92 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3832 T GAAAAC G C C C AAAAT AAT T A GT AGGAGT T CAT CT T TAAAGG G GAT AT T CAT T T G 388 6 

Qy 2 928 AT T C CAT T G G GGAGGGT C AGGGAAGAAC AAAGC CT T GACAT T GCAGT GC AGT T T C AC 2 984 

III I I I I I I I I I II I I I I I I I 1 II II I I I I I I I I I I II I I I II I I I I I I I 

Db 38 87 AT TAT AC G G GGGAGGGT CAGGGAAGAAC GAA- C CT T GAC GT T GCAGT GCAGT T T C AC AGA 3 94 5 

Qy 2985 AGAT C T T TAT T T TT AG CAAC GCAGT G- T CT GAG GAAAAAT GAC CT GT CT T GA 3035 

I I I I II I II I I I II I I I I III M I I I I I I I I I I I I I I M I I I I I I I 

Db 394 6 TCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGA 4 005 

Qy 3036 CTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAAT 3095 

Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 

Db 4 006 CT GC CAT GT GT T CAT C AT CTT AAGT AT T GT AAGCT GCT AT GT AT GGAT TTAAACC GT AAT 4 065 

Qy 3096 CAT AT TTGTTTTT C CT GT AT GAGGC AC T G GT GAATAAACAAAGAT C T GAGAAAGC T GT AT 3155 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 4066 CATA — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC C T GT AT 4112 

Qy 3156 ATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

III I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 4113 ATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4172 

Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGT 3266 

I I I I I I I I II I I I I II I I I II I I II M I I I I I I I I I I I I 

Db 4173 AGAAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGAT 4232 



Qy 32 67 T GAT GCAG ATT TT CT GAAAT GAAA T G T T T AGAC GAG AT CAT G C C AC C AAG G C AG G A 3322 

I II I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I M M I I I I I I 

Db 4233 T GAT GCAGATT TT CT GAAAT GAAAT GTT T GT T T AGAC GAGAT CAT ACC GGTAAAGCAGGA 4292 

Qy 3323 GTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATT 3382 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 4293 ATGACAAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATT 4351 

Qy 3383 AATT GC CAATATAAGTAAAT ATAGATT AT AT AT AT CTATAT AT AGT GTTT CAC GAAGCTT 3442 

I I I II I I I I I I I I I I I I I I I I II I I I II I I I II I I I I I I I I II I I I I I I I I I 

Db 4352 AAT T GC CAATATAAGTAAAT AT AGAT TAT AT AT G TAT AGT GTTT CACAAAGCTT 4 4 05 

Qy 3443 AGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTT 34 96 

II I II I I I I I I I II I I I I I II I I I I I I I I I I I I I I M Mill I I I I 

Db 44 06 AGACCTTTACCT T - C CAGCCACCC CACAGT GCT T GAT AT TT CAGAGT CAGT CATT GGTT A 4 4 64 

Qy 3497 TAT GTGT GTAGT C C CAAAGCACATAAGCTAGGGAGAAAC GTACTT CT AGGC GCACTAC CA 3556 

II ' I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I M I I I 

Db 4 465 T AC AT GT GTAGT T C C AAAG CAC AT AAG C T AGAAGAAGAAAT AT TT C T AGGAGCACT AC C A 4524 

Qy 3557 TCTGTTTT CAACAC GAACC GAC G C CAT G CAAAC AG AACT C C - T CAAC AT AAACT T CACT G 3615 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I M I I I I I I I I 
Db 4525 TCTGTTTTCAACATGAA AT G C C AC AC AC AT AGAACT C C AAC AAC AT C AAT T T CAT T G 4581 

Qy 3616 C ACAGAC T TACT GT AGT T AAT T TT AT CAC — AAACT CT GGACT GAAT CTAAT GCTT CCAA 3673 

I I I I II I I I I I I I I I I I I I I I I I I I II II II I I I I I I I II I II I I I I I I I I I I I 

Db 4 582 CACAGACT GACT GT AGTTAATTTT GT CACAGAAT CTAT GGACT GAAT CTAATGCTT CCAA 4 641 

Qy 3674 AAA T GT T T GCAAAT AT CAAAC AT T GT T AT GT AAGAAAAT AT AAAT 3718 

Ml I I II I I I I II I I I I I I I I I I I I I M I I I I I I I I III I I I I 

Db 4642 AAAT GTTGTTTGTTT GCAAAT AT CAAAC ATT GT TAT G CAAGAAAT TAT T AATT ACAAAAT 4 7 01 

Qy 3719 GAC GAT T TAT AC AAT T GT GGT T T AAG CT GT AT T GAACT AAAT CT GT GGAAT GCAT T GT GA 3778 

II I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I II I II I I I I I I I I I I I 

Db 4702 GAAGAT T TAT AC CAT T GT GGT T T AAGCT GT ACT GAACT AAAT CT GT GGAAT GCAT T GT GA 4761 

Qy 377 9 AC T GT AAAAGCAAAGT AT CAAT AAAGC T TAT AGAC T TAAAAAAAAAAAAAAAAAA 3833 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II II 1 I I I II I I I III 

Db 4 7 62 ACT GT AAAAGCAAAGT AT CAAT AAAGC T TAT AGAC T TAAAAN GNNANN AGN GAAA 4816 



RESULT 9 
AF148537 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



PRI 09-SEP-2000 



AF148537 4632 bp mRNA linear 
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Location/Qualif iers 
1. .4632 
/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
142. .3720 
/note= M RTN4a" 
/ codon_start=l 
/product="reticulon 4a" 
/protein_id="AAG12176. 1" 
/db_xref="GI : 10039551" 

/ trans lation="MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED 
EDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAPPVAP 
ERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWT 
P PAPAPAAP P S T PAAP KRRGSSGSVDETL FAL PAAS E P VI RS S AENMDL KEQ P GNT I S 
AGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEV 
SEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKL 
VSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFER 
VWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPST 
PEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEK 
NTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 
TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS FEES EAT PSPVLPDIVMEAPLNSAV 
PSAGASVIQPSSS PLEAS S WYES I KHEPENPPPYEEAMSVSLKKVSGIKEEIKEPEN 
INAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSP 
DSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKP 
YLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 
ETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 
LPHDLSLKNIQPKvFlEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKV 
LVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 
LTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELV 
QKYSNSALGHWCTIKELRRLFLVDDLVI)SLKFAVLMWVFTYVGALFNGLTLLILALI 
SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE" 

signal 4605. .4610 

"site 4622 



Query Match 69.1%; Score 2649.8; DB 9; Length 4632; 

Best Local Similarity 85.0%; Pred. No. 0; 

Matches 3292; Conservative 0; Mismatches 442; Indels 137; Gaps 



24; 



Qy 

Db 



853 



CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I 
CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 912 



Qy 

Db 



61 GT AC T G C C C ACT GAAG GAAC AC T T C C AGCAAC T T CAAAT GAAG C T T C T AAAGC AT T CT C A 12 0 
III I I I I II I II I I I I I I I I I I I II II I I I II I I I I I I I I I I I I I I I I 
913 GT AT T AC C C ACT GAAGGAAC ACT T CAAGAAAAT GT CAGT GAAG C T T C T AAAGAG GT CT C A 972 



Qy 121 GAGAAG G C AAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T C AGAAT T G GAAT AT 18 0 

I I I I I I I I I I I M II I I I I I I III I M I I I I II I I II I II II I I I I I I I I 
Db 973 GAGAAG GCAAAAAC T CT ACT C AT AGAT AGAGAT T T AAC AGAGT T T T C AGAAT TAGAAT AC 1032 

Qy 181 T CAGAAAT GGAAT CAT CAT T C AGT G G C T C T CAAAAGG C AGAAC C T G C C GT AACAGT AG C G 24 0 

I I I II I I I I I I I I I II I II I I I I Mill III I I I I I I I I I I I I I I I I I I I I I 
Db 1033 T CAGAAAT G GGAT CAT C GT T C AGT GT C T CT C C AAAAG C AGAAT CT GC C GT AATAGT AG C A 1092 

Qy 241 AAT C C T AG GGAC GAAAT AGT T GT GAG G AGT AGAGAT AAAGAAGAG GAC T T AGT T AGT C T T 300 

I I I I I II I I I I MINI I I I I I I II MM I I I I I I I I I II II I II I I I 

Db 1093 AAT C CT AG GGAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 1152 

Qy 301 AAC AT C CT T C AT ACT CAGC AG GAGT T AT C T AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 

M I I I II I I II I I Ml II I I II I I MUM I I II I I I II Ml I II I I I 

Db 1153 AACAT C C T T CAT AAT CAACAAGAGT T AC CT AC AGCT CT T ACT AAAT T G GT T AAAGAGGAT 1212 

Qy 361 GAT AGAGT T CT GT CT C C AGAAAAAACAAAGGACAGT T T T AAGGAAAAGG GAGT T G CAG C A 42 0 

II II I I I I II I I II II II I MM I II I I I I I II I I I I I I I I I I I I I I I I 

Db 1213 GA AGT T GT GT CTT C AGAAAAAGC AAAAGAC AGT T T T AAT GAAAAGAGAGT T G CAGT G 12 69 

Qy 421 GAAG CT T C TAT GG G GGAGGAAT AT GC AGAC T T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 4 80 

II I I I I I I II I II II I M I II II I I II II I I II I I II I I II I I I I I I I II I I I I I M I 
Db 1270 GAAGCT C C TAT GAGGGAGGAAT AT GCAGAC T T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 1329 

Qy 4 81 AAAGATACTT ACAAGCAAGATAGT GAT GTTTT GATT GCT GGAGGTAAT AT AGAGAGCAAA 54 0 

I II I II I I III II I I II I II M I Ml I II I I II I I I I I II I II I I I I I 

Db 1330 AAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCT GGAG GT AAAAT C GAGAG C AAC 138 6 

Qy 541 T T GGAAGGTAAAGT GGATAAGAAAC AC T T T T C AGAT AGC CTT GAACAAACAAAT C GT G AA 600 

II II M I I II II II I I I I I II I Ml I I II I I M II II I Mill II I I Ml 

Db 1387 T T G GAAAGT AAAGT GGAT AAAAAAT GT T T T G C AGAT AGC C T T GAG CAAAC T AAT C AC GAA 1446 

Qy 601 AAAGATAGT GAAAG CAGT AAT GAT GAC ACT T CAT T T C C CAGT AC AC C AGAAG CT GTAAGA 660 

I I II I II I I I I II II M I I I I II I II I II II I I I I II II I II II I I I Ml 
Db 1447 AAAGATAGT GAGAGT AGTAAT GAT GATACTT CTTT C CCCAGT ACGCCAGAAGGT ATAAAG 1506 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AACT GAGAAT GT T 717 

I I I I I I I I I I I II I I II I II I II I I II I I I I I I I II I I I M I II I II 

Db 1507 GAT C GT T C AGGAGC AT AT AT C ACAT GTGCTCCCTT T AAC C CAGC AGC AAC T GAGAGC AT T 1566 

Qy 718 T CAACAAACATTTTTCCCTT GTT GGAAGAT CATACTT CGGAAAATAAGACAGAT GAAAAA 777 

II I I I I I I I I II II I I I II II I Mill I I I I I I I II II I II I II I II II I I I I 
Db 1567 G CAACAAACAT T T T T C CT TT GT T AGGAGAT C C T AC T T C AGAAAAT AAGAC C GAT GAAAAA 1626 

Qy 77 8 AAGAT AGAA- AAAAAAAGGC ACAAAT T GT AAC AGAGAAGAAT GCAAGT GT CAAGACAT C A 836 

II MUM MM I I I I I I II I I II I I I I I I I I I I I II I II III I I M I I 

Db 1627 AAAAT AGAAGAAAAGAAG GC C CAAAT AGT AAC AGAGAAGAAT ACT AG C AC CAAAACAT C A 168 6 

Qy 837 AAC C CT T T C CT T AT GG C AGC AC AG GAGT C T AAGACAGAT T AC GT T ACAAC AGAT CAT GT G 896 

I I I M II I Ml I I I I I II II I I I III I II II I I II I II I I I I I I II I II I 
Db 1687 AAC CCTTTTCTT GT AG CAGC AC AG GAT T C T GAGACAGAT TAT GT C ACAAC AGAT AATT T A 1746 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I I I II I I II I II I I I I I I II I II II I I I I I II I I II I II II II I I I I I I II 
Db 1747 ACAAAGGT GACT GAGGAAGT C GT G GCAAAC AT G C CT GAAG G C C T GACT C C AGAT T TAGT A 1806 

Qy 957 CAG GAAGC AT GT GAAAGT GAAT T GAAT GAAG CT ACT G GT AC AAAAAT T G C C T T T GAAAC A 1016 



I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 1807 CAGGAAGCAT GT GAAAGT GAATT GAAT GAAGTTACT GGTACAAAGATT GCT T AT GAAACA 1866 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I Mill 
Db 1867 AAAAT GGACTT GGTTCAAACATCAGAAGTTATGCAAGAGT CACTCTAT CCT GCAGCACAG 1926 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I II I I I I I I I I I II I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1927 CT T T GC C CAT C ATT T GAAGAGT CAGAAG CT ACT C C T T C AC C AGT T T T G C C T GAC AT T GT T 198 6 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I MM III 
Db 1987 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 04 6 

Qy 1197 T CAT CAC CAT T AGAAACT CT T C CT T C AGT TAAT TAT GAAAG CAT AAAGT T T GAGC CT GAA 1256 

I II I I I I II I I I II I I I I M I I I I I I II I I I I M II I I II I I I II II II I I I 

Db 2047 T CAT CAC CAT T AGAAG C T T C T T C AGT TAAT TAT GAAAG C AT AAAAC AT GAG CCT GAA 2103 

Qy 1257 AATCCCCCAC CAT AT GAGGAGGCCAT GAAT GT AT CACT AAAAAAAGAAT CAGGAAT GAAT 1316 

II I I I M I I I I I I II I I II I I I I M I I I I I I I II I II I M II I I II I I I II I II 

Db 2104 AACC CCCCAC CAT ATGAAGAGGCCAT GAGTGTAT CACTAAAAAAAGTAT CAGGAAT AAAG 2163 

Qy 1317 GAAGAAAT C AC AGAGC CT GAAGGT AT T AGT GT AGCT GT T C AG GAAAC AGAAGCT C C TT AT 1376 

I II I I I I I I I II I I I I M I I II II M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2164 GAAGAAAT T AAAGAGC C T GAAAAT AT TAAT GCAG CT C T T CAAGAAACAGAAGC T C CTT AT 2223 

Qy 1377 AT AT CTATT GCATGTGATTTAATTAAAGAAACAAAGAT CT CTACT GAAC CGACT CCAGAT 1436 

I II I I I I II I I I I I I II I I II I I II I I I I II I I I II I III I I II II I I I II III 
Db 2224 AT AT CT AT T G C AT GT GATT T AAT T AAAGAAACAAAGCT T T CT G CT GAAC C AGCT C C GG AT 22 8 3 

Qy 1437 T T CT CT AGT T ATT CAGAAAT AG C AGAAGT T GCAC AGC C AGT GC CC GAGC AT T C T GAGCT A 14 96 

I I I I I I I I I I II I I I I I I III MIMI I I I I II II I I I I II I I I II II I I I II 
Db 2284 T T CT CT GAT TAT T CAGAAAT GG CAAAAGT T GAAC AGC C AGT GC CT GAT CAT T CT GAGCT A 234 3 

Qy 1497 GT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GACT T AT T T AGT GAT GAT T CAAT AC C C 1556 

I II I I II I I I I II I II I I I I I I II I II I I I I II I I I II I II I I I I I I I II I I II II I 

Db 2344 GT T GAAGAT T C CT C AC CT GAT T CT GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC CT 24 03 

Qy 1557 GAAGTT C C ACAAAAAC AAGAT GAAGC T GT AAT AC T T GT GAAAGAAAAC C T CACT GAAAT T 1616 

II I I I I I I I I I I I I I I I I I II I I I I I I II I I I II II I I II II I I I I I I I I I I 

Db 2404 GAC GTT C C ACAAAAACAAGAT GAAACT GT GAT GC T T GT GAAAGAAAGT CT CAC T GAGAC T 2463 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGj^AAACTCAGTGCTTCACCATCACCT 167 6 

I I I I I I II I I II I I I II I I I II II I I I I I I I I II I I I I I I II I I Mill 

Db 2464 T CATTT GAGT CAAT GAT AGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 2520 

Qy 1677 GAG GGAG GAAAAC C GT AT T T G GAGT C T T T T C AGC C CAGT T T AGGC AT C ACAAAAGAT AC C 1736 

II M I II II I I II II II I I II I I I I M III I II II I I I I II I II I I I II I I I 

Db 2 521 GAGGGAG GAAAGC C AT AT T T G GAAT C T T T T AAGC T CAGT T TAG AT AAC ACAAAAGAT AC C 258 0 

Qy 1737 T TAG CAC C T GAT GAAGT TT C AGC AT T GAC C CAAAAGGAGAAAAT C C CT TT G C AGAT GGAG 17 96 

I M II I I II I II I I I II MIMI I I II I M M I I I I I I II M II I II I I I I I 

Db 2581 CT GT T AC CT GAT GAAGT TT C AAC AT T GAG CAAAAAGGAGAAAAT T C C T TT G CAGAT GGAG 2640 



Qy 



1797 GAG C T CAAT ACT GC AGT TT AT T C AAGT GAT G GCT TAT T CAT T G CT C AG GAAG CAAAC C T A 1856 

II M I M I I M II II I I I I II M I I I I II MIMI Ml II I I II II I I I M 



Db 2641 GAGCT CAGT AC T GC AGT T TAT T CAAAT GAT GAC T TAT T TAT T T CT AAGGAAGC AC AGAT A 27 00 

Qy 1857 AGAGAAAGT GAAACAT T T T C AGAT T CAT C T C C GAT T GAGAT TAT AGAT GAGT T C C C GAC C 1916 

I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 27 01 AGAGAAAC T GAAAC GT T T T C AGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT AC A 2760 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 197 6 

II I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2761 T T GAT CAGT T C T AAAACT GAT T CAT T T T CT AAAT T AGC C AG G GAAT AT AC T GAC CTAGAA 2820 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2 821 GT AT C C C ACAAAAGT GAAAT T GC T AAT G C C C C G GAT G GAGCT GGGT CAT T G C C T T GC AC A 28 80 

Qy 2037 GGAT T GC C C CAT GAC CT T T C T T T CAAGAGT AT ACAAC C TAAAGAGGAAGTTCAT 2 0 90 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

Db 28 81 GAAT T G C C C CAT GAC CT T T C T T T GAAGAAC AT AC AAC C C AAAGT T GAAGAG AAAAT CAGT 294 0 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I I I I I I I II II III II II II I I I I I II I I M I II I II I I I II 
Db 2941 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 3000 

Qy 2151 GAT GTTTCTGCTTTG GAT GCT CAAG C AGAGAT AGG CAG CAT AGAAAAAC C CAAAGT T C T T 2210 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3001 GAT GTTTCTGCTTTGGC CAC T CAAG CAGAGAT AG AGAG C AT AGT T AAAC C C AAAGT T C T T 3060 

Qy 2211 GT GAAAGAAGC C GAGAGAAAAC TT C C T T C T GAT AC AGAAAAAGAGC GAAGAT CT C CAT C T 227 0 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 3061 GT GAAAGAAGCT GAGAAAAAAC TT C CT T C C GAT AC AGAAAAAGAG GAC AGAT CAC CAT C T 3120 

Qy 2271 G CT ATAT TT T C AGC AGAG CT GAGT AAAACTT CAGT T GT T GAC C T C C T CT ACT G GAGAGAC 2330 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I II I I I II I 
Db 3121 G CTAT AT T T T C AGCAGAG C T GAGT AAAACTT C AGTT GT T GAC C T C CT GT ACT G GAGAGAC 3180 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 3181 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 324 0 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

I I I I I I I I I I I I I I II I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 32 41 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 33 00 

Qy 24 51 T T T AGGAT AT AT AAG GGT GT GAT C CAG G C TAT C C AGAAAT CT GAT GAAGG C CAC C CAT T C 2510 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 3301 T T T AGGAT AT AC AAGG GT GT GAT C CAAG CTAT C C AGAAAT C AGAT GAAG G C CAC C CAT T C 3360 

Qy 2511 AG GGCAT AT T T G GAAT C T GAAGTT GC T AT AT CT G AGGAGT T G GT T C AGAAGT AC AG CAAT 257 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I Ml 
Db 3361 AG GG CAT AT CT G GAAT C T GAAGT T GC T AT AT CT GAGGAGT T G GT T C AGAAGT AC AGT AAT 3420 

Qy 2571 TCTGCTCTT GGT CAT GT T AAC T GCACAAT AAAAGAAC T CAGAC GC C T C T T CT T AGT T GAT 2 630 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 3421 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 34 80 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 3481 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 354 0 



Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3541 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3600 

Qy 2751 AT T TAT GAAC GGCAT C AG G C G CAAAT AGAT CAT TAT CT GGGAC T T GCAAAT AAGAAT GT T 2810 

I I I I II I I I I I I I I I I I II I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3601 AT T TAT GAAC GG CAT C AGG CAC AGAT AGAT CAT TAT C TAG GAC T T G CAAAT AAGAAT GT T 3660 

Qy 2811 AAAGAT G CT AT G GCT AAAAT C C AAG C AAAAAT C C CT G GAT T GAAGC GT AAAG C T GAAT GA 2 870 

I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3661 AAAGAT G CT AT GGCT AAAAT C CAAG CAAAAAT C C CT GGATT GAAG C G CAAAGCT GAAT GA 372 0 

Qy 2871 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 2930 

Mill I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3721 AAAC GC C CAAAATAAT TAGT AG G AGT T CAT C T T T AAAG G G GAT AT T CAT T T GAT T 3775 

Qy 2931 C CAT T G GGGAGG GT C AGGGAAGAAC AAAG C CT T GACAT T GCAGT G C AGT T T CAC AGAT CT 2 990 

II I I I I I 

Db 3776 AT ACGGATCT 3785 

Qy 2991 TTATTTTTAGCAAC GCAGT G-TCTGAGGAAAAAT GAC CTGTCTTGACTGCCCTGTGTTCA 304 9 

I I I I I I II I I I I III II I I I I I I I I I I I i I I II I 1 I I I I I I I I I I I I I I I I I I 
Db 3786 TTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCA 3845 

Qy 3050 TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTC 3109 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3846 T CAT CT TAAGT AT T GTAAGC T GCT AT GT AT G GAT T T AAAC C GTAAT CAT A — TCTTTTTC 3903 

Qy 3110 CT GT AT GAG GCACT GGT GAAT AAAC AAAGAT C T GAGAAAGC T GT AT AT T ACACT T T GT C G 3169 

I I I I I I I I I I I I I I II I I I I II I I II I I I I I I I I I I I I I 

Db 3904 CT AT CT GAGGCACTGGT GGA ATAAAAAACCT GT AT AT T T T AC T T T GT T G 3952 

Qy 3170 CAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAA 3222 

I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 3953 CAGATAGT CTT GCCGCAT CTTGGCAAGTT GCAGAGAT GGTGGAGCTAGAAAAAAAAAAAA 4 012 

Qy 3223 TAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTG 32 82 

II I II I I I I I I I I I I I I i I I I I I I I I II I I I I I I I I II I I I I I M I II I 
Db 4013 AAAAGCCCTTTTCAGTTTGT GCACT GTGTAT GGT CCGTGTAGATT GAT GCAGATTTTCTG 4072 

Qy 3283 AAATGAAA T GT T T AGAC GAG AT CAT G C CAC CAAG GC AGGAGT GAAAAAG C T T G C C T 3338 

I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I III I I I I I I I I II 

Db 4 073 AAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C GGT AAAGC AGGAAT GACAAAGC T T G - CT 4131 

Qy 3339 TTCCTGGTATGTTCTAGGTGTATTGTG7WVTTTACTGTTGTATTAATTGCCAATATAAGT 3398 

II I I I I I I I M I I M I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I 

Db 4132 TTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGT 4191 

Qy 3399 AAAT AT AGAT TAT AT AT AT CT AT AT AT AGT GT T T CAC GAAG CT T AGC C C T T T AC CT T C C C 34 58 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I II 

Db 4192 AAAT AT AGAT TAT AT AT G TATAGTGTTTCACAAAGCTTAGACCTTTACCTT-CC 4244 

Qy 3459 AG CT GC C C C ACAGT GC T T GAT AC T T CT GT CAT GGGTTTT AT GT GT GT AGT CC CA 3512 

III I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 4245 AG C CAC C C C ACAGT GCT T GAT AT T T C AGAGT C AGT CAT T G GT TAT AC AT GT GT AGT T C C A 4304 



Qy 3513 AAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGA 3572 

I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 4305 AAGC ACAT AAG C T AGAAGAAGAAAT AT T T C TAG GAGC ACT AC CAT C T GT T T T CAAC AT GA 4364 

Qy 3573 AC C GAC G C CAT GC AAAC AGAACT C C - T C AAC AT AAAC T T C AC T GC ACAGACT TACT GT AG 3631 

I I I I I I I I I I I I I I I I I i II II I II I I I I I M I II I I I I I I I I I I I 
Db 4365 A AT G C CAC ACAC AT AGAAC T C C AACAAC AT C AAT T T CAT T GCAC AGACT GACT GT AG 4421 

Qy 3632 T T AAT T T TAT CAC — AAACTCTGGACTGAATCTAATGCTTCCAAAAA TGTTTG 3682 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 4422 T T AAT T T T GT C ACAGAAT CT AT G GACT GAAT C T AAT GCT T C C AAAAAT GT T GT T T GT T T G 4481 

Qy 368 3 CAAATATCAAACATTGTTATGTAAGAAAATAT AAAT GAC GAT T TAT AC AAT T 3734 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I II I I I II III 

Db 44 82 C AAAT AT C AAAC AT T GT TAT G C AAG AAAT TAT T AAT T AC AAAAT GAAGAT T TAT AC CAT T 4541 

Qy 3735 GT GGT T T AAG C T GT AT T GAAC TAAAT C T GT G GAAT G CAT T GT GAACT GT AAAAG C AAAGT 3794 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 4 542 GT GGT T TAAG C T GT ACT GAACT AAAT CTGTG GAAT GC AT TGT GAACT GT AAAAG CAAAGT 4601 

Qy 3795 AT C AAT AAAGC T T AT AGAC T T AAAAAAAAAA 3825 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 4 602 AT C AAT AAAG C T TAT AGACT T AAAAAAAAAA 4 632 
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AX195249 4053 bp RNA linear PAT 28-AUG-2001 

Sequence 5 from Patent WO0151520. 

AX195249 

AX195249.1 GI: 153858 09 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Strittmatter, S .M. 

Nogo receptor-mediated blockade of axonal growth 
Patent: WO 0151520-A 5 19-JUL-2001; 
YALE UNIVERSITY (US) 

Location/ Qualifiers 

1. .4053 

/organism-"Homo sapiens" 
/mol_type="unassigned RNA" 
/db_xref="taxon: 9606" 
135. .3713 

/note="unnamed protein product; Human mRNA for Nogo 

protein (KIAA0886, GenBank Accession No. AB020693) " 

/codon_start=l 

/protein_id="CAC59983. 1" 

/db_xref="GI : 15385810" 

/ db_xr e f = " REMTREMBL : CAC 5 9983" 

/ trans la tion="MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED 
EDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAP 
ERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWT 
P P AP APAAP P S T P AAP KRRG S S G S VDET L FAL PAAS E P VI RS S AENMDL KEQ P GNT I S 



AGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEV 
SEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKL 
VSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFER 
VWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPST 
PEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEK 
NTSTKTSNPFLV7^QDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 
TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV 
PSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPEN 
INAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSP 
DSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKP 
YLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 
ETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 
LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKV 
LVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 
LTVFS I VSVTAYI ALALLS VT I S FRI YKGVTQAIQKS DEGHP FRAYLESEVAI S EELV 
QKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALI 
SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE" 

ORIGIN 

Query Match 62.1%; Score 2379.4; DB 6; Length 4053; 

Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I II I II I I I I I I I I II I I I II I I I I I I I I I I I II I I I I II II 
Db 84 6 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGTUVCATGAATACCTTGGTAATTTGTCT^ACA 905 

Qy 61 GT AC T GC C C AC T GAAG GAAC ACT T C C AG C AACT T CAAAT GAAG CT T CT AAAGC AT T C T C A 12 0 

I I I I I I I I I I I I I I I I I I I II I I II II I I II I I I I I I I I I I I I I I I I I 
Db 906 GT AT TAG C CACT GAAGGAAC ACT T CAAGAAAAT GT C AGT GAAGCT T CTAAAGAGGT CT C A 965 

Qy 121 G AG AAG G C AAAAAAT C CAT T T GT AG AG AG AAAT T T AAC AG AAT T T T C AG AAT T G G AAT AT 18 0 

I I I I I I I I I I I I I II I I I II I III I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 966 GAGAAGGCAAAAACT CT ACT C AT AGAT AGAGAT T T AAC AGAGT TT T C AGAAT T AGAAT AC 1025 

Qy 181 T CAGAAAT G GAAT CAT CAT T CAGT G GCT CT CAAAAGGC AGAAC CT GC C GTAAC AGT AG C G 24 0 

I I I M I I I I I I I I I I I I II I I II I I I I I III I I II I I I I I I I I I I I I I I I I I 
Db 1026 T CAGAAAT GGGAT C AT CGT T CAGT GT C T C T C CAAAAG C AGAAT CT GC C GT AAT AGTAGC A 108 5 

Qy 241 AAT C CT AG G GAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAG GACT T AGT T AGT C T T 3 00 

I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I II I I I I II I 
Db 108 6 AAT C C T AGG GAAGAAAT AAT CGT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 1145 

Qy 301 AAC AT C C T T CAT ACT C AG C AG GAGT T AT CT ACAGT C C T T AC GAAAT CAGT T GAAGAAGAA 360 

I I I I I I I I I I I II III II I I I I I I I I I I I I I I I I I I I I I III I I I I II 
Db 114 6 AAC AT C CT T CAT AAT CAACAAGAGT T AC CT ACAG CT C T T ACTAAATT GGT T AAAGAGGAT 12 05 

Qy 361 GAT AGAGT TCTGTCTC C AGAAAAAACAAAGGAC AGT T T T AAG GAAAAGGGAGT T GCAG C A 42 0 

M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I 
Db 1206 GA AGT TGTGTCTT C AGAAAAAG C AAAAGAC AGT T T T AAT GAAAAGAGAGT T G CAGT G 1262 

Qy 421 GAAGCT T CT AT GGG G G AGGAAT AT G C AGAC T T CAAAC CAT T T GAG CGAGT AT G GGAAGT G 48 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I M I I I I || I I I I I I I I I I I I II I 
Db 12 63 GAAGC T C C TAT GAG G GAG GAAT AT GC AGACT T CAAAC CAT T T GAG C GAG TAT G G GAAGT G 1322 



Qy 



481 AAAGAT ACT TACAAGCAAGAT AGT GAT GT T T T GAT T G CT G GAGGT AAT AT AG AGAGCAAA 54 0 
I I I I I I I I Ml I I I I I II I I I I I III I I I I I I I I I I I I II I I I I I I I I 



Db 1323 AAAGATAGT AAG GAAGAT AGT GAT AT GTTGGCTGCTG GAG GT AAAAT C GAGAGC AAC 1379 

Qy 541 T T G GAAGGT AAAGT G GAT AAGAAAC ACT T T T C AGAT AGC CT T GAAC AAAC AAAT C GT GAA 600 

MINI I I I I I I I I I I I I I III III I I I I I I I I I II I I I I I I I I I I I III 
Db 1380 T T G GAAAGT AAAGT G GAT AAAAAAT GT T T T G C AGAT AGC C T T GAG C AAAC T AAT CAC GAA 1439 

QY 601 AAAGATAGT GAAAGCAGTAAT GAT GACACTT CATTT C CC AGTAC ACCAGAAGCT GTAAGA 660 

I I M I I II I I I II I I I I I I I I I M I I I I I II I I I I I I I I I I I I I II I III 
Db 144 0 AAAGATAGT GAGAGT AGTAAT GAT GAT ACT T CT T T C C C C AGT AC G C C AGAAG GT ATAAAG 14 9 9 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AAC T G AGAAT GT T 717 

I I I I I I I I I I I II I I I II I I II I I I I I I I I I I I I II I I I || I I I I || 
Db 1500 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTT^CCCAGCAGCAACTGAGAGCATT 1559 

Qy 718 T CAACAAAC AT TTTTCCCTTGTT GGAAGAT CAT ACT T C G GAAAAT AAGAC AGAT GAAAAA 777 

I I I I I I I I I I I I I I I I I I I I I | I I I M I I I I II I I I I I I I I I I I I I I I I II I I 
Db 1560 GC AAC AAACAT TTTTCCTTTGT TAG GAGAT C CT ACT T CAGAAAAT AAGAC C GAT GAAAAA 1619 

Qy 77 8 AAGAT AGAA- AAAAAAAGGC ACAAAT T GT AACAGAGAAGAAT G C AAGT GT CAAGAC AT C A 836 

M I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I M I I II III I I I I I I 
Db 162 0 AAAAT AG AAG AAAAGAAG GC C C AAAT AGT AAC AGAGAAGAAT AC TAG CAC C AAAAC AT C A 1679 

Qy 837 AAC C CT T T C CT TAT GG C AGC AC AG GAGT CT AAGAC AGAT T AC GT T ACAAC AGAT CAT GT G 896 

I I I I I I I I I M I I I I I I II I I I I III I II I I I I I I I II I I I I I I I I I II I 
Db 168 0 AAC CCTTTTCT T GT AGCAGC AC AGGAT T C T GAGAC AGAT TAT GT CACAAC AG AT AAT T T A 1739 

Qy 897 T CAAAGGT GAC CGAGGAAGT AGT GGC AAAC AT G C C T GAAG GT C T AAC C C C AGAT T T GGT T 956 

I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I || 
Db 1740 ACAAAGGT GACT GAGGAAGT CGT GGCAAAC AT GC C T GAAG GC C T GAC T C CAGAT TT AGT A 1799 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 18 00 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGT T AC T G GT AC AAAGAT T G C T TAT GAAAC A 1859 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I Mill 
Db 1860 AAAAT GGACT T GGT T CAAAC AT C AGAAGT TAT GC AAGAGT C ACT CT AT C CT GC AGCACAG 1919 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M 
Db 192 0 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1979 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I III 
Db 1980 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2039 

Qy 1197 T CAT CAC CAT T AGAAAC T CT T C CT T C AGT T AAT TAT GAAAGC AT AAAGT T T GAG C C T GAA 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 204 0 T CAT CAC CAT T AGAAGC T T CT T C AGT T AAT TAT GAAAG CAT AAAAC AT GAGC CT GAA 2096 

Qy 1257 AAT CCC C CAC CAT AT GAGGAGGCCAT GAAT GTAT CACTAAAAAAAGAAT CAGGAAT GAAT 1316 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I II I I I I I I M 

Db 2097 AAC C C C C CAC CAT AT GAAGAGGC CAT GAGT GTAT CAC TAAAAAAAGT AT C AG GAATAAAG 2156 

Qy 1317 GAAGAAAT C ACAGAGC C T GAAG GTAT T AGT GT AG CT GT T C AG GAAAC AGAAGCT C CT T AT 1376 

I I I I I I I I I I I I I I I I I I I I I I I I II | | | | | | I I II I I I I I I I I I I I I I I I I 
Db 2157 GAAGAAAT T AAAGAG C CT GAAAAT AT T AAT G C AG C T C T T CAAGAAAC AGAAG CT C CT T AT 2216 



Qy 1377 ATAT CTATT GCAT GT GAT TTAATTAAAGAAACAAAGAT CT CTACT GAAC C GACT CCAGAT 1436 

I M I I II I I I I I I I I I I I I I I I I I I | | | | I | I M I I I III II I I I II I I I I IN 
Db 2217 AT AT CT AT T G CAT GT GAT T T AAT T AAAGAAAC AAAGCT T T C T G CT GAAC C AG C T C C G GAT 2276 

Qy 1437 T T CT CT AGT TAT T C AGAAAT AG C AGAAGT T G CAC AGC C AGT G C C C GAG CAT T C T GAGC T A 1496 

I I I I I I II I I I II I I I I I II I I I I I I I I II II I I I I I I I II II II I I I I M I I 
Db 22 77 T T C T CT GATT AT T C AGAAAT G GCAAAAGT T GAACAGC C AGT G C CT GAT CAT T C T GAGC T A 233 6 

Qy 14 97 GTT GAAGATT CCT CC CC C GATT CT GAACCAGTT GACT TATTT AGT GAT GATT CAATACCC 1556 

M I I I I I I I I I I II II I I I I I I II I I I II I I I I I M I I I I I I I I I I II I I I II M I I 
Db 2337 GTT GAAGAT T C C T CAC C T GAT T C T GAAC C AGT T GAC T TAT T TAG T GAT GAT T C AAT AC C T 2396 

Qy 1557 GAAGTT C C ACAAAAACAAGAT GAAGCT GT AAT ACT T GT GAAAGAAAAC CT CAC T GAAATT 1616 

I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I II I I I I 
Db 2397 GACGTT CCACAAAAACAAGAT GAAACT GT GATGCTTGTGAAAGAAAGT CT CACT GAGACT 2456 

Qy 1617 T CAT C T GAGT CAAT GAC AGGACAT GAC AAT AAGGGAAAAC T C AGT G CT T CAC CAT CAC C T 1676 

MM II II I I I II II II I I M I I I I I II I I II I I I I I I II I II I M I II 
Db 2457 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 2513 

Qy 1677 GAGG GAGGAAAAC C GT AT T T GGAGT C T T T T CAG C C CAGT TT AG GC AT C ACAAAAGAT AC C 173 6 

M II II II II I I I I I I I II I I MIMI III MINIM I I I I II I I I M I I I 

Db 2514 GAGGGAG GAAAGC C AT AT T T GGAAT CT T T TAAG C T C AGT T TAG AT AAC ACAAAAGAT AC C 2573 

Qy 1737 T T AGCAC CT GAT GAAGTT T C AGC AT T GAC C CAAAAG GAGAAAAT C C C T T T GC AGAT G GAG 1796 

I II I I M I II II M II I I I I I I I I I I I I II II II II I I II I II II I I II I I I 
Db 2574 CT GT T AC CT GAT GAAGTT T C AAC AT T GAG C AAAAAG GAGAAAATT C C T T T GC AGAT G GAG 2633 

Qy 17 97 GAGCT CAAT ACT GC AGTT T AT T CAAGT GAT G GCT T AT T CAT T GCT CAGGAAG CAAAC CT A 1856 

M I II I I I II I I I I I II II I II II Mill I I II I I III II II I I II 1 I I M 
Db 2 634 GAGCT CAGT ACT GCAGTTTATT CAAAT GAT GACTTATTTATTT CTAAGGAAGCACAGAT A 2693 

Qy 1857 AGAGAAAGT GAAACAT T T T C AGAT T CAT CT C C GAT T GAGAT TAT AGAT GAGT T C C C GAC C 1916 

M II II II I II I II I II I I I II I I I Mill II II II II M I I I I II I II 

Db 2694 AGAGAAAC T GAAAC GT T T T C AGATT CAT C T C CAAT T GAAAT TAT AGAT GAGT T C C CT AC A 2753 

Qy 1917 T T T GT CAGT T C TAAAGC AGATT CT T CT C C T ACAT T AGC C AGGGAAT AC ACT GAC C TAGAA 1976 

M II II II II I I I I I I I I I I I I M | | | | | || M II I II I I II I I I I I II I I 
Db 2754 TT GAT CAGT TCTAAAACT GATT CAT TTTCTAAATTAGCCAGGGAAT AT ACT GACCTAGAA 2 813 

Qy 1977 GT AG C C C ACAAAAGT GAAAT T G CT GAC AT C CAG GAT GGAGCT GG GT CAT TGGCTTGTG C A 2036 

Ml II I II II I II II II I I II II I II I I I I I I I I I || | | | | | | | | MM II 
Db 2814 GTATCCCACAAAAGT GAAAT TGCTAATGCCCCGGAT GGAGCT GGGTCATTGCCTTGCACA 2873 

Qy 2037 GGAT T GC C C CAT GAC C T T T C T TT C AAGAGT AT ACAAC C T AAAGAG GAAGT T CAT 2090 

I I M II II II I II II I II II I I M II M I I II II I I I M I II I I 

Db 2874 GAAT T G C C C CAT GAC C T T T CT T TGAAG AAC AT ACAAC C CAAAGT T GAAGAG AAAAT CAGT 2 933 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

M I II II M II II II I I I I I Ml I II II II II I I II II M I II I 
Db 2934 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2993 

Qy 2151 GAT GTTTCTGCTTTG GAT GCT CAAGC AGAGAT AGG C AG CATAGAAAAAC C CAAAGT T C T T 2210 

M I I II I M I II I II II I II I I I I I I I I M I II II I II I II 

Db 2 994 GAT GTTTCTGCTTTGGC CAC T CAAGC AGAGAT AGAGAGC AT AGT TAAAC C CAAAGT T C T T 3053 



2211 GT GAAAGAAGC C GAGAGAAAAC TTCCTTCT GAT AC AGAAAAAGAG C GAAGAT CT C CAT CT 227 0 

I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I | | | || || | | | | | | | | | | | | 
3054 GT GAAAGAAGC T GAGAAAAAACT T C CT T C C GAT AC AGAAAAAGAGGAC AGAT C AC CAT C T 3113 

2271 GC TAT AT T T T C AG CAGAGC T GAGT AAAACTT C AGT T GT T GAC CT C C T C T AC T GGAGAGAC 2330 

I I I M I I I II I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I || | | | M I 

3114 GCT AT AT T T T CAG CAGAGC T GAGTAAAACTT CAGT T GT T GAC C T C C T GT AC T GGAGAGAC 3173 

2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I M I I I I I I I I I I I M I I M I I I I I I I I I I I II I I I || I I I I I I I I 

3174 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3233 

2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2 450 

M I I I I II I Mill I I I II I || I I | | | | | | | | | | | | || | | | | | | | | | | | | | | 

3234 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 32 93 

2451 T T TAG GAT AT ATAAG GGT GT GAT C CAG GCT AT C C AGAAAT C T GAT GAAG GC C AC C CAT T C 2510 

I I I I I I I I I I I I I I I I I I || I I I I | | | | | | | | | | | | || | | | | | | | | | | | | I I I I I I I 

32 94 T T TAG GAT ATAC AAG GGT GT GAT C CAAG CT AT C CAGAAAT CAGAT GAAGG C C AC C CAT T C 3353 

2511 AGGGCATATTTGGAATCT GAAGTT GCT ATAT CT GAGGAGTT GGT TCAGAAGT ACAGCAAT 257 0 

IN M I I I I I I I I I I I I I I M M I I I I I I I I I II I M I M M II III 

3354 AGGGCAT AT C T G GAAT C T GAAGT T GCT AT AT CT GAGGAGTT GGTT CAGAAGT AC AGT AAT 3413 

2571 TCTGCTCTTGGTCATGTTT^CTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

INN I I II I I I I I I I I I I I I II II I I I I I I I I I I I I I I I II I I I I I I I I I 

3414 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3473 

2 631 GAT T TAGT T GAT T CTCT GAAGT T T GC AGT GT T GAT GT GG GT AT TT AC CT AT GTTGGTGCC 2 690 

I N I I I I I I I I I I I I I | || | | | | | M | | | | | | | | | | | | | | | || || | | | | | | | | | | | | || | 
3474 GAT T TAGT T GAT T CT CT GAAGT T T GC AGT GT T GAT GT GG GT AT T T AC CT AT GT T G GT G C C 3533 

2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

INN I I I I I I I I I I II I I I M | |M M | | | | | || | | | || | | || | | | | | | 

3534 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 35 93 

2751 AT T TAT GAAC GG CAT CAG GC GCAAAT AGAT CAT TAT CT GG GAC T T G CAAAT AAGAAT GT T 2810 

I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I || | I || | || | | 
3594 AT T TAT GAAC GG C AT CAG GC ACAGAT AGAT CAT TAT CTAG GACT T G CAAAT AAGAAT GT T 3653 

2 811 AAAGAT G C TAT G G CTAAAAT CCAAGCAAAAAT C C CT GGAT T GAAG C GT AAAGCT GAAT GA 2 870 

HI I I I I I M I I I I I I I I I I I I I I I I I II I M I I II I I I I I I I I I I II 

3654 AAAGAT G C TAT G G CTAAAAT CCAAGCAAAAAT C C CT GGAT T GAAG C GCAAAGCT GAAT GA 3713 

2 8 71 GAAAGC C T GAAAGAGTTAACAAT AGAG GAGT T TAT C T T T AAAGGG GAT AT T CATT T GAT T 2 930 

II 'I' I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
3714 AAAC GC C C AAAAT AAT T A GT AGGAGT T CAT C T TT AAAGGG GAT AT T CATT T GAT T 3768 

2931 C CAT T G GG GAG G GT C AG GGAAGAAC AAAG C C T T GAC AT T GC AGT GC AGT T T C AC 2 984 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
37 69 AT AC GGG G GAGGGT CAG G GAAGAAC GAA- C CT T GAC GT T G CAGT G CAGT T T CAC AGAT C G 3827 

2985 AGAT CT T TAT T T T TAG CAAC G CAGT G- T C T GAGGAAAAAT GAC CT GT C T T GACT G 3038 

I I I I I I I I I I I I II I I I Mill I I I II I I I II I I I I 

3828 T T GT T AGAT CT T TAT T T T TAG C CAT G C ACT GT T GT GAGGAAAAAT T AC C T GT CT T GACT G 38 87 

3039 CCCTGTGTT CAT CAT CT T AAGTATT GT AAGCT G C TAT GT AT GGAT T TAAAT C GTAAT CAT 3098 



Db 


3888 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 
CCAT GT GTT CAT CAT CTTAAGTATT GTAAGCT GCT AT GT AT GGATT TAAACCGTAAT CAT 3947 


Qy 


3099 


AT TTGTTTTTCCT GT AT GAGGC ACT GGT GAATAAACAAAGAT C T GAGAAAGCT GT AT AT T 3158 
1 1 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 
A--TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC CT GT AT AT T 3 9 94 


Db 


3948 


Qy 


3159 


ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II I | 
TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4 051 


Db 


3995 
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AB020693 4053 bp mRNA linear PRI 16-JUN-1999 

Homo sapiens mRNA for KIAA0886 protein, complete cds . 

AB020693 

AB02 0693. 1 GI : 4240260 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (sites) 

Nagase,T., Ishikawa,K., Suyama,M., Kikuno,R., Hirosawa,M. , 
Miyajima,N., Tanaka,A., Kotani,H., Nomura, N. and Ohara,0. 
Prediction of the coding sequences of unidentified human genes. 
XII. The complete sequences of 100 new cDNA clones from brain which 
code for large proteins in vitro 

DNA Res. 5 (6), 355-364 (1998) ^ <r-v ^ r & ^ W ^V^TD 

99156230 £>l y^CtX/^\J~\ 

10048485 

2 (bases 1 to 4053) 

Ohara,0., Suyama,M., Kikuno,R., Nagase,T. and Ishikawa,K. 
Direct Submission 

Submitted ( 02-DEC-1998 ) Osamu Ohara, Kazusa DNA Research Institute, 

Laboratory of DNA Technology; Yana 1532-3, Kisarazu, Chiba 

2 92-0812, Japan ( E-mail : cdnainf o@kazusa . or . jp, Tel : +8 1-4 38-52-3 913, 

Fax:+81-438-52-3914) 

Location/Qualifiers 

1. .4053 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="hk07722" 

/sex="male" 

/tissue_type= "brain" 

/clone_lib="pBluescriptII SK plus" 

/dev_stage= "adult" 

1. .4053 

/gene="KIAA0886" 
135. .3713 
/gene="KIAA0886" 
/codon_start=l 
/product="KIAA0886 protein" 
/protein_id="BAA74909. 1" 
/db xref="GI: 4240261" 



/ translation="MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED 

EDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAP 

ERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWT 

P PAP APAAP P S T PAAP KRRGS S GS VDET L FAL PAAS E P VI RS SAENMDLKEQ P GNT I S 

AGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEV 

SEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKL 

VSNNI LHNQQELPTALTKLVKEDEWS S EKAKDS FNEKRVAVEAPMREEYADFKP FER 

VWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPST 

PEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEK 

NTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 

TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV 

PSAGASVTQPSSS PLEAS SVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPEN 

INAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSP 

DSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKP 

YLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 

ETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 

LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKV 

LVKEAEKKLPSDTEKEDRSPSAIFSi^ELSKTSVVDLLYWRDIKKTGWFGASLFLLLS 

LTVFS IVSVTAYIALALLSVT I S FRI YKGVI QAI QKSDEGHP FRAYLES EVAI S EELV 

QKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 

S LFS VPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE " 

ORIGIN 



Query Match 62.1%; Score 2379.4; DB 9; Length 4053; 

Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 



Qy 


i 


CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 
II 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | | | | 1 1 1 I I I | | | | | | | | | | | | | || 
CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 


60 


Db 


846 


905 


Qy 


61 


GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 

IN 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II I I | | | | M 1 1 Ill 1 M 1 1 

GT AT T AC C C ACT GAAGGAAC ACTT CAAGAAAAT GT CAGT GAAG C T T C TAAAGAGGT CT C A 


120 


Db 


906 


965 


Qy 


121 


GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAAT T TAAC AGAAT T T T C AGAAT T GGAAT AT 
1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 MM III M II 1 II M 1 II M 1 II II 1 1 1 1 1 1 1 
GAGAAGGCAAAAACT CT AC T C AT AGAT AGAGAT T T AAC AGAGT T T T C AGAAT T AGAAT AC 


180 


Db 


966 


1025 


Qy 


181 


T CAGAAAT G GAAT CAT CAT T CAGT GGC T CT CAAAAG G C AGAAC CT GC C GTAAC AGTAG C G 

1 1 II II II M 1 1 M II 1 1 1 II 1 1 1 II 1 1 III 1 1 II II 1 1 1 M 1 II 1 1 1 II 1 1 

T CAGAAAT G G GAT CAT C GT T CAGT GT CT CT C CAAAAG C AGAAT CT GC C GTAAT AGTAGC A 


240 


Db 


1026 


1085 


Qy 


241 


AAT C C T AGG GAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAG GACT T AGT TAGT CTT 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 MM 1 M 1 II 1 II II II 1 1 1 1 II 1 1 M 1 1 1 

AAT C C T AGG GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT TAGT AAT 


300 


Db 


1086 


1145 


Qy 


301 


AACAT CCTT CATACT CAGCAGGAGTT AT CTACAGT C CTTAC GAAAT CAGT T GAAGAAGAA 

1 1 1 M 1 II II 1 II Ml II II M II 1 1 II II Mill 1 II 1 III MM II 

AACAT CCTT CAT AAT CAAC AAGAGT T AC C T AC AGCT C T TACTAAAT T G GT T AAAGAG GAT 


360 


Db 


1146 


1205 


Qy 


361 


GAT AGAGT T C T GT CT C C AGAAAAAACAAAG GACAGT T T TAAG GAAAAG G GAGT T G CAG C A 

N 1 1 1 1 Mill II 1 II II 1 III M II 1 1 1 M 1 II 1 1 

GA AGT T GT GT C T T CAGAAAAAG CAAAAG AC AGT T T TAAT GAAAAGAGAGT T G CAGT G 


420 


Db 


1206 


1262 


Qy 


421 


GAAGC T T CT AT G G G GGAGGAAT AT GC AGAC T T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 


480 



MINI II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I | | | | | I I I I 



12 63 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1322 



481 AAAGAT ACTT ACAAGCAAGATAGT GATGTTTT GAT T GCT GGAGGT AATATAGAGAGCAAA 540 

I I I I I I I I II I I | | M | | IN | | M I I I I I I I I M II I I I I I I 

1323 AAAGAT AGT AAGGAAGATAGT GAT AT GTT GGCT GCT GGAGGTAAAAT CGAGAGCAAC 1379 



541 



TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

I I I I I I M I M I I I I I I I I III I I I I I II I I I I Mill INI III 

1380 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 1439 



601 
1440 



AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

minimi m i i i i i i i i i m i i i i i i i i i i i i i ii i i i i m i i m 

AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1499 



661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

' 1 UN I II II II I I I I I II I I II I I I II II I M II I I I II I II I II 

GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 



1500 



1559 



1560 



718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 
' < I M II I I I I I M I I I I II I I Mill I I I II I II I M I I I I II I II I I I I I I 
GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1619 

778 AAG AT AG AA- AAAAAAAG G C AC AAAT T G T AAC AGAG AAGAAT G C AAGT GT C AAG AC AT C A 836 

I I I I I II I I Mill II I I II I I II II II I I II III I II I I I 

1620 AAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCA 1679 

837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 896 
I I I I M II III I I I II I M I II I III II I I II I I II M I II I I I I I I II I 
1680 AAC CCTTTTCTT GT AGCAGC AC AGGATT CT GAGAC AGAT TAT GT CAC AAC AGAT AAT T TA 1739 

8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I I I I I I I I I I I II I II I I II I I I I II || Ill II M I I I I I I I I II 

17 40 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1799 

957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

I I I M M I I I I I I I II II I M I I II II I I I I I I I I II Mill I II 

CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1859 



1800 



1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

I'll I I I II II I I II I I II I I I I I I II I I I II I II I I II I I I I I I 

AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 



1860 



1919 



1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

'Mill II I I I I I II I I I II I I II I | I II I I I I I I 

CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 



1920 
1137 



1979 
1196 



ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 

Nil Mill I I I I I I I I I II I I II I II I I I I II I I I I I I MM III 

1980 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2039 

1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

nnAn N II II II I I | | II I I I I I II I I | M II II 

2040 TCATCACCATTAGAAGC— TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2096 

1257 AAT C C C C C AC CAT AT GAG G AGG C CAT G AAT GT AT C AC T AAAAAAAGAAT C AG G AAT G AAT 1316 
11 N II I I II I I I I I I I | | | | | | M I I M M I II I I I I II I || M I I I I II I II 

2 097 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2156 



1317 GAAGAAAT CAC AGAGC CT GAAG GTAT T AGT GT AG C T GT T CAGGAAAC AGAAG C T C CT T AT 1376 

MINIM I I I I I I I M I I II I I II II I I I I I I I I I I I I I I I I I I I I 

2157 GAAGAAAT T AAAGAG C C T GAAAAT AT T AAT GC AG C T C T T CAAGAAAC AGAAG C T C C T TAT 2216 

1377 AT AT C TAT T G CAT GT GAT T T AAT T AAAGAAAC AAAG AT C T C T AC T GAAC C G AC T C CAGAT 1436 

I I M I I I I II I I I I I I I I I I I | | | I I I II I I I I I I I | Ml I | | | M I I I I I III 
2217 AT AT C TAT T GCAT GT GAT T T AAT TAAAGAAAC AAAG CTTTCTGCT GAAC C AG C T C C G GAT 227 6 

1437 T T C T C T AGTT AT T CAGAAAT AG C AGAAGT T G CACAG C C AGT G C C C GAGC AT T C T GAGC T A 14 96 

N I I I I I I I I II I I I I I I III I I I I II I II I I | I I | | | | || I I I I I I I 

2277 T T C T CT GATT AT T CAGAAAT G G CAAAAGT T GAACAGC C AGT GC C T GAT C ATT CT GAGC T A 2336 

1497 GT T GAAGAT TCCTCCCCC GAT T C T GAAC C AGT T GACT TAT T T AGT GAT GAT T C AAT AC C C 1556 

I M I I II I I I I I II II I I I I I I I I I I I I I I II I I I I I I I I I I | | | | | | M I I I I I M 

2337 GTT GAAGATT C CT CACCT GATT CT GAAC CAGTT GACTTATTTAGT GAT GATT CAATAC CT 2396 

1557 GAAGT T C C ACAAAAACAAGAT GAAG CT GTAAT AC T T GT GAAAGAAAAC C T C ACT GAAAT T 1616 

I I M I I II II I I I I I I I I I I I | | | | M || | || | | | | | | | | | | | | | | | | | | | | 

2397 GAC GT T C C ACAAAAACAAGAT GAAACT GT GAT GC T T GT GAAAGAAAGT C T C ACT GAG AC T 2456 

1617 T CAT C T GAGT CAAT GAC AGGAC AT GAC AAT AAGGGAAAAC T C AGT GC TT C AC CAT CAC CT 1676 

IN I I I M I I I I I I I I | | | | | | | | | | | | | | | | M I I I I I | | | 

2457 TCATTTGAGT CAAT GATAGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 2 513 

1677 GAG G GAGGAAAAC C GT AT T T G GAGT CT T T T C AGC C CAGT T T AGGC AT C ACAAAAGAT AC C 173 6 

MINIUM! I I MINIM II I I II I II I N I I I I I I I M II I II I I I II 

2514 GAGG GAGGAAAGC CAT AT TT G GAAT CT T T TAAGC T CAGT T TAGATAAC ACAAAAGAT AC C 2573 

1737 T T AGC AC C T GAT GAAGT T T CAGC AT T GAC C C AAAAGGAGAAAAT C C C T T T GC AG AT G GAG 179 6 

I I M I II I I I II I I I I I II I I II | || I I I II I I I II I N I I I I I II I I || I I 
2574 CT GT T AC C T GAT GAAGT TT CAAC AT T GAGCAAAAAGGAGAAAAT T C C T T T G CAGAT GGAG 2 633 

1797 GAGCT CAATAC T G C AGT TT AT T CAAGT GAT G GCT TAT T CAT T G CT C AG GAAGCAAAC C T A 1856 

MINN I I I II I II I I II I I II I INN I I I II I I II II 11111(11 I II 

2 634 GAGCT CAGT ACT GCAGTTTAT T CAAAT GAT GACTT ATTTATTT CTAAGGAAGCACAGAT A 2693 

1857 AGAGAAAGT GAAAC AT T T T CAGATT CAT CT C C GAT T GAGAT T AT AGAT GAGT T C C C GAC C 1916 

I M I I II I II I I I I || I I II II I I I I I I I I INN I II I I I N I I II I I I II II 
2694 AGAGAAACT GAAACGTTTT CAGAT T CAT CT C CAAT T GAAAT TAT AG AT GAGT T C CCT ACA 2753 

1917 T T T GT CAGT T CTAAAG CAGAT TCTTCTCC T ACAT TAG C C AG GGAAT AC ACT GAC CT AGAA 197 6 

N M I I II I II II I I II I I I I III I N I II I I I II I II I I I I II I I I I I II 

2 754 T T GAT C AGT T CTAAAACT GAT T C ATT T T C TAAAT TAG C CAG GGAAT AT ACT GAC CT AGAA 2 813 

1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

HI H I I M I II N II II N I II I N | | || I I I I 

2 814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2873 

2037 G GAT T GC C C CAT GAC CT T T CT T T C AAGAGT AT ACAAC C TAAAGAG GAAGT T CAT 2090 

I M II I II II II N II I II I I I I I I I I II || N I I I I II I I I I I 

2 874 GAAT T GC C C CAT GAC CT T T C T T T GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 2 933 

2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I N I I I I I M M N I II I I III II I N I || I I I I I I I 

2 934 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2993 



2151 
2994 



GAT GT TTCTGCTT T GGAT G CT CAAGC AGAGAT AG G CAG C AT AGAAAAAC C CAAAGT T C T T 2210 

M I I I II I I II II IN I I I I I I I I I I I I I I I MMIM I I I I I I I I I I I I I I I 

GAT GTTTCTGCTTT G GC CAC T CAAG C AGAGAT AGAGAG C AT AGT TAAAC C C AAAGT T C T T 3053 



2211 GT GAAAGAAGC C GAGAGAAAAC TTCCTTCT GAT AC AGAAAAAGAG C GAAGAT C T C CAT C T 2270 
I I I I I I I I I I I I I I I I | | | | | | | | | N I I II | | | | | | | I I I I I I I | | | | | | | | 
GT GAAAGAAGCT GAGAAAAAACTT CCTT C CGATACAGAAAAAGAGGACAGAT CAC CAT CT 3113 



3054 
2271 
3114 
2331 
3174 
2391 
3234 
2451 
3294 



GC TAT AT T T T C AGC AGAG CT GAGTAAAAC T T C AGT T GT T GAC C T C CT C T AC T G GAGAG AC 2330 

1 1 I I I I I I I I I I M I I M I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I | I I I 

G CT ATAT T T T CAG CAGAG C T GAGTAAAAC T T C AGT T GT T GAC C T C C TGT ACT GGAGAGAC 3173 

ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 
I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | | 

ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3233 

TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 245 0 

I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I | | | | | | | f I I | | I I I M | || 

TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3293 

T T T AGGAT ATATAAG G GT GT GAT C C AGGC T AT C CAGAAAT C T GAT GAAG G C CAC C CAT T C 2510 

urn i 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 mi 

T T T AGGAT AT ACAAGGGT GT GAT C CAAGCT AT C CAGAAAT C AGAT GAAGGC CAC C CAT T C 3353 



2511 AGGGC AT ATT T G GAAT C T GAAGT T GC T AT AT CT GAGGAGT T GGT T C AGAAGT AC AGCAAT 2570 
I I I I I I I I I I I I I || | | | | | | M | | | | | | | | | | | | | | || | | | || | | | | | | || | | | in 
AGGGCAT ATCTGGAAT CT GAAGT T GCTAT AT CT GAGGAGT T GGT T CAGAAGTACAGTAAT 3413 



3354 
2571 
3414 
2631 
3474 
2691 
3534 
2751 
3594 
2811 
3654 
2871 



TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

I > M I I I M Ml I II II II I Mill I II M II I I II II I I I I I || I m I II 

TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3473 
GAT T T AGT T GAT T CT C T GAAGT T T GC AGT GT T GAT GT GGGT AT TT AC C TAT GT T G GT G C C 2690 

II I I I I I I I II I I M I I M M II I I I II I I M I II I I I II I M I II I I I II I | | | M M I 

GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3533 

TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2 750 
I I I I I I I I M I M M I I M M I I I || I || || I I I I III | | | M I I II I I I II I II II 
TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3593 

AT T TAT GAAC G GCAT C AGGC GCAAAT AGAT CAT TAT CT GGGAC T T GCAAATAAGAAT GT T 2810 

I I I I I M II I I I I II | | | || || | || || II I I I I I II I I II II I II I I I I I 

AT T TAT GAAC GG CAT CAG G C ACAGAT AGAT CAT TAT C TAG GAC T T G CAAAT AAGAAT GT T 3 653 

AAAGAT G C TAT G G C T AAAAT C CAAG C AAAAAT C C CT GGAT T GAAG C GT AAAG C T GAAT GA 2870 

I I I I I I I I I I I I I II II || | | || M II II II I II I MMIM 

AAAGAT GCTAT GGCT AAAAT C CAAGC AAAAAT C C C T G GAT T GAAG C G CAAAGCT GAAT GA 3713 

2930 



GAAAG C CT GAAAGAGT TAACAAT AGAGGAGT T TAT C T T T AAAGG G GAT AT T CAT T T GAT T 

„ u 1 111 I Nil I I IIIIIIIIIIMMMMMMIMM 

3714 AAAC G C C C AAAAT AAT T A GT AGGAGT T CAT C T T TAAAG GG GAT AT T CAT T T GAT T 37 68 

2 931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 298 4 

I ' ' II II II I II II I I I II II | M I I I II I || | | | || | | || 

3769 AT AC G GGG G AG G GT CAG G GAAGAAC GAA- CCTT GAC GT T G CAGT G C AGT T T CAC AGAT C G 3827 

2 985 AGATCTTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTG 3038 



Db 


3828 


Qv 


3039 


Db 


3888 


Qy 


3099 


Db 


3948 


Qy 


3159 


Db 


3995 



I I I I I I I I I I I I I I I I I I III I! | IIIMMMM IIIIIIIIIIIMI 

TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 38 87 

CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 3098 
11 1 1 N I I I I I I I I I I I I I I I M I I I I M I I I II I I I I I I II I I I I I I I I I I II I I I I 
C CAT GT GT T CAT CAT C T T AAGT AT T GT AAG C T G C TAT GT AT G GAT T T AAAC C GTAAT CAT 394 7 

AT TT GT T T T T C CT GT AT GAGG C AC T G GT GAAT AAAC AAAGAT C T GAGAAAGCT GT AT AT T 3158 

1 I I I I M I I I I | I | | M I I I II I I I I I I II I I it 

A— TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAAC C T GT AT AT T 3994 

ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 
I I I I I I I I I I I I I II I I I I I | || | | | | | | | , || | | | | | | | | | | | 

TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4 051 
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AF333336 3491 bp mRNA linear PRI 27-MAY-2003 

Homo sapiens testis specific reticulon 5 
cds . 

AF333336 

AF333336. 1 GI : 13377627 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 

1 (bases 1 to 3491) 

Zhou,Z.M., Sha,J.H., Li,J.M., Lin,M. , Zhu,H., Zhou,Y.D., Wang,L.R., 
Zhu,H., Wang,Y.Q. and Zhou,K.Y. 

Expression of a novel reticulon-like gene in human testis 

Reproduction 123 (2), 227-234 (2002) 

21857060 

11866689 

2 (bases 1 to 3491) 
Sha,J.H., Zhou,Z.M. and Li, J. M. 

Direct Submission 

Submitted (04-JAN-2001) Key Lab of Reproductive Medicine, Nanjing 
Medical University, Han Zhong Road 140, Nanjing, Jiangsu 210029, 
China 

Location/Qualifiers 
1. .3491 

/ organism="Homo sapiens" 
/ mo 1_ t yp e = "mRNA " 
/db_xref="taxon: 9606" 
/ tissue_type="testis" 
/dev_stage="adult" 
179. .3139 
/note=" reticulon" 
/codon_start=l 

/product="testis specific reticulon 5 protein" 
/protein_id="AAK2 0831 .1" 
/db_xref="GI : 13377628" 

/trans lation="MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHE 
YLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSP 
KAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEWS S EKAK 



DS FNEKRVAVEAPMREEYADFKPFERVWEVKDS KEDS DMLAAGGKI ESNLES KVDKKC 
FADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPL 
LGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 
EWANMPEGLTPDLVQEACESELSEWGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 
SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENP 
PPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPD 
FSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLT 
ETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPL 
QMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE 
YTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATS 
KVLLLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTS 
WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 
AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLK 
FAVLMWVFTYVG7VLFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMA 
KIQAKIPGLKRKAE" 
polyA_signal 3402. .3407 
ORIGIN 

Query Match 62.1%; Score 2379.2; DB 9; Length 3491; 

Best Local Similarity 86.4%; Pred. No. 0; 

Matches 2806; Conservative 0; Mismatches 388; Indels 54; Gaps 14; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTTW^GAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I M I I I I I I I I I I Ml M I I I I I I I I I I I I I I I I I I I I I I || || 

°b 272 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 331 

QY 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 12 0 

I M I I I I I I I I II I I I I I I I I I I II II I I I I || | | || | | | | | 

Db 332 GTATTAC C CACT GAAGGAACACTT CAAGAAAAT GT CAGT GAAGCTT CT AAAGAGGT CT C A 391 

Qy 121 GAGAAG G C AAAAAAT C CAT T T GT AG AGAGAAAT T T AAC AGAAT T T T C AGAAT T G G AAT AT 18 0 

I I I I I I I M I I II II I I I I II III MINIMI! I I I I M I I I I I I I I I I 

Db 3 92 GAGAAG G CAAAAAC T CT ACT C AT AG AT AGAGAT T TAAC AGAGT TT T C AGAAT T AGAAT AC 451 

Qy 181 T CAGAAAT G GAAT CAT CATT C AGT GGC T CT CAAAAGGC AGAAC CT G C C GTAAC AGT AGC G 24 0 

I M I I M I I I MINI I I I I I I I Mill Ml II I I I I I II I M II I I I I I I I 

Db 4 52 T CAGAAAT G GGAT C AT CGTT C AGT GT C T C T C CAAAAGC AGAAT CT GC C GT AAT AGT AGC A 511 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I II I I 

Db 512 AAT C CT AG GGAAGAAATAAT C GT GAAAAATAAAGAC GAAGAAGAGAAGT T AGT T AGTAAT 571 

Qy 301 AACAT C C T T CAT ACT CAGCAGGAGT T AT C T AC AGT C CT T AC GAAAT C AGT T GAAGAAGAA 360 

I I M I II I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I I I I I I I 

Db 572 AACAT C CT T CAT AAT CAACAAGAGT T AC CT AC AGC T C T TACT AAAT T GGTT AAAGAG GAT 631 

Qy 3 61 GAT AGAGT T C T GT CT C CAGAAAAAAC AAAGGAC AGT T T T AAGGAAAAGG GAGT T G CAG C A 42 0 

II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 632 GA AGT T GT GT CT T C AG AAAAAG C AAAAGAC AGT T T T AAT GAAAAGAGAGT T G CAGT G 68 8 

Qy 421 GAAG C T T CT AT GG G G GAGGAAT AT GC AGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 480 

I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I | | | | | | | | | | | | | | | | || | 
Db 689 GAAG C T C C TAT GAG G GAG GAAT AT GC AGAC T T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 74 8 

Qy 4 81 AAAGAT ACT T ACAAGCAAGAT AGT GAT GT T T T GAT T G C T GGAGGT AAT AT AGAGAG CAAA 54 0 

I I I I I I I I I I I I I I I I I I M II I I I I II I I I I II I I I I I I I I I I I I I I 
Db 749 AAAGATAGT AAG GAAGAT AGT GAT AT GTTGGCTGCTG GAGGT AAAAT C GAGAG CAAC 805 



Qy 541 T T GGAAG GTAAAGT GGATAAGAAAC AC T T T T CAG AT AG C C T T GAACAAACAAAT C GT GAA 600 

I I I I I I I I I I I I I I I I I I I III Ml I I I I ! I I I I I I | I | | | | | MM III 
Db 806 T T GGAAAGTAAAGT GGATAAAAAAT GT T T T G C AGAT AG C C T T GAG CAAACT AAT CAC GAA 8 65 

Qy 6 °1 AAAG AT AGT G AAAGC AGT AAT GAT GAC ACT T CAT T T C C C AGT AC AC C AGAAG CT GT AAGA 660 

I I I I I I I I I I I II I I II I I I I I M II I I I M I I I I I I II I II I I I I I III 
Db 866 AAAGAT AGT GAGAGT AGT AAT GAT GAT ACT T C T T T C C C C AGT AC GC C AGAAG GT AT AAAG 925 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AAC T GAGAAT GT T 717 

I I IIM Mill II Mill I I II I II I I II I I I I I II II II II I I I II 

Db 92 6 GAT C GT T CAG GAG CAT AT AT CAC AT GT GCTCC CT TT AAC C C AGC AG CAAC T GAGAGC AT T 985 

Qy 718 T CAACAAAC AT TTTTCCCTTGTT GGAAGAT CAT ACT T C GGAAAAT AAGAC AGAT GAAAAA 777 

I N II I II I I II II I I II I I I I I I I II I M I I I I I II I II I I II I I I I I | M I 
Db 98 6 G CAAC AAAC AT T T T T C CT TT GT T AGGAGAT C C TACT T CAGAAAAT AAGAC C GAT GAAAAA 104 5 

Qy 77 8 AAGAT AGAA- AAAAAAAGGCACAAAT T GT AACAGAGAAGAAT GC AAGT GT CAAGACAT CA 836 

M I I II II I I I I Mill I I I I I I I II II I I I I I I I II I II II I II | M I 
Db 104 6 AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AACAGAGAAGAAT ACT AGC AC CAAAACAT C A 1105 

Qy 837 AAC C CT T T C CT TAT GG CAGC AC AGGAGT CTAAGACAGAT T AC GT T ACAAC AGAT CAT GT G 896 

I I I I I I I I III I II II I I I I I II I II I I II I I I I I I || || M M I I I II I 
Db 1106 AAC CCTTTTCTT GT AG CAGC ACAGGAT T CT G AGACAGAT TAT GT C ACAAC AGAT AAT T T A 1165 

Qy 8 97 T CAAAGGT GAC CGAG GAAGT AGT GGCAAAC AT GC CT GAAGGT CTAAC C C C AGAT TT G GT T 95 6 

M I I I II I M II I I II I I I I I I I I I II I I II II I I I I I II M I I I II I II II 
Db 1166 ACAAAGGT GACTGAGGAAGT CGT GGCAAAC AT GC CTGAAGGCCT GACT C CAGATTTAGTA 1225 

Qy 957 C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGCTACT G GT AC AAAAAT T GC CT T T GAAAC A 1016 

I I I I I I M M II M I I M I I II II I I I I I I M II I I I II II I Mill I I II I I M 

Db 1226 C AGGAAG CAT GT GAAAGT GAAT T GAGT GAAGT TACT GGT ACAAAGAT T G CT T AT GAAAC A 1285 

Qy 1017 AAAAT GGAC C T GGT T C AAACT T C AGAAG C T GT GCAG GAGT C ACT T T AC C CT GTAACAC AG 1076 

I I I M II II II II I I II II M I II M I I I I I I II II II I II I I II I Mill 
Db 12 86 AAAAT GGACTT GGTTCAAACAT CAGAAGTTAT GCAAGAGT CACT CT ATCCT GCAGCACAG 1345 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I M II II I II I II I I I M M II I II II I I || M I II I I II I I I I II 

Db 134 6 C T T T GC C CAT C AT TT GAAGAGT C AGAAGCT AC T C C T T CAC CAGT T T T GC CT GAC AT T GT T 14 05 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I II II I I II I II II II I I I || | | M II I I I I II Ml 
Db 1406 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1465 

Qy 1197 T CAT CAC CAT T AGAAACT C T T C C T T CAGT TAATT AT GAAAGCATAAAGT TT GAG C C T GAA 1256 

I I I I I II II II I I II M I I I I I I I I I I M I I I II || I || M I I II II I II II 

Db 14 66 T CAT CAC CAT T AGAAG C T T C T T CAGT T AAT TAT GAAAG C AT AAAAC AT GAG C C T GAA 1522 

Qy 1257 AAT C C C C CAC CAT AT GAGGAGG C CAT GAAT GT AT CAC T AAAAAAAGAAT CAG GAAT GAAT 1316 

' I I I I I I II II II I II I I I II I II II I II I I M I II I I II I I I II II 

Db 1523 AAC C C C C CAC CAT AT GAAGAG G C CAT GAGT GT AT CACTAAAAAAAGT AT CAG GAAT AAAG 1582 

Qy 1317 GAAGAAAT CACAGAGCCT GAAG GT AT T AGT GTAG C T GT T CAG GAAACAGAAG C T C C T TAT 137 6 

I I > I I I I I I I I II M I I M I II II II I I II II II I I || I I I || I I I I 

Db 1583 GAAGAAAT T AAAGAGC CT GAAAAT AT T AAT G CAGC T CT T CAAGAAAC AGAAG C T C C T TAT 1642 



1377 AT AT C TAT T G CAT GT GAT T TAAT T AAAGAAACAAAGAT C T C T AC T GAAC C GACT C C AGAT 1436 

i M I I II I II I I I I I I I I I I II I I I I I I I I I I I I II I III M I I I I I I I I I Ml 
164 3 AT AT C TAT T GCAT GT GAT T TAAT T AAAGAAACAAAG CT T T C T G CT GAAC C AGCT C C G GAT 17 02 

1437 TT CT C T AGT T AT T CAGAAAT AGC AGAAGT T G CAC AG C C AGT G C C C GAGC AT T CT GAGC T A 14 96 

I I I I I I M I I I II I I I M I I I I I I I I I | | | || I I I I I I 

1703 TT CT CT GATTAT T CAGAAAT GGCAAAAGTT GAAC AGC C AGT GC CT GAT C ATT CT GAGCT A 1762 

1497 GT T GAAGAT TCCTCCCCC GAT T C T GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC C C 1556 

I I M I I I I I I I I I I II I I I II I I I | || | | | | | | | | | | | | | || | | | | || | | | | | | || | 

1763 GT T GAAGAT T C CT CAC C T GAT T C T GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC C T 1822 

1557 GAAGTT CCACAAAAACAAGAT GAAGCT GTAATACTT GT GAAAGAAAAC CT CACT GAAATT 1616 

II I I I I M I I I I I I I I I I I I M I I I II II II I I I II I I II I I I I II I I II I I 
1823 GAC GT T C C AC AAAAAC AAGAT GAAAC T GT GAT G CT T GT G AAAGAAAGT C T CAC T GAGAC T 1882 

1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCTTCAC CAT CACCT 1676 

I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I | | I I I 
18 83 T CAT T T GAGT CAAT GAT AGAAT AT GAAAATAAG GAAAAAC T CAGT GC T TT GC CACCT 1939 

1677 GAGGGAGGAAAAC C GT AT T T GGAGT CT T T T C AGCC C AGT TT AGGC AT CACAAAAGAT AC C 1736 

I I I I I I I M I I II I II I I I I I M | | | | | | | || | | | | | | | | | || || | | | | | | | 
194 0 GAGGGAGGAAAGC C AT AT T T GGAAT CT T T TAAG CT C AGT TT AGATAACAC AAAAGAT AC C 1999 

1737 T T AGC AC C T GAT GAAGT T T C AG CAT T GAC C C AAAAGGAGAAAAT C C C T T T G C AGAT G GAG 1796 

I I I I I I I I I I I I I I M I I I II I I I I II I I || I | | | | | | | | M M Ml 

2000 C T GT T AC CT GAT GAAGTT T CAACAT T GAG CAAAAAGGAGAAAATT C CT T T G C AGAT G GAG 2 059 

1797 GAGCT CAAT ACT GCAGTT TAT T CAAGT GAT G G C T TAT T CAT T GCT C AG GAAG CAAAC C T A 1856 

I I M I II I I I I I I I I II M I I I I I Mill I I I I II III II II M M II I M 

2 060 GAGC T CAGT AC T G CAGT T TAT T CAAAT GAT GAC T TAT T TAT T T C T AAGGAAG CAC AGAT A 2119 

1857 AGAGAAAGT GAAAC AT T T T C AG AT T CAT C T C C GAT T GAGAT TAT AG AT GAGT T C C C GAC C 1916 

I I I I I I I I I I I I I I M M I I M I I II M II I I I II I II II II II M M II II II 

2120 AGAGAAACT GAAAC GT T T T CAGATT CAT C T C CAAT T GAAAT TAT AGAT GAGT T C C C T AC A 2179 

1917 T T T GT CAGT T CTAAAGC AGATT CTT C T C C T AC ATT AG C CAGGGAAT ACACT GAC CTAGAA 197 6 

II M M M II II I I Mill I I Ml I I I II II I I II I I I I I II II II II II I 

218 0 T T GAT CAGT T CT AAAACT GATT CAT T T T C TAAATT AGC CAGGGAAT AT ACT GAC CTAGAA 2239 

1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

I M II II II I II II II I II I I II I II II I II II II I II I I II I II II II II 
2240 GTAT C C C ACAAAAGT GAAAT T GCTAAT G C C C C G GAT GGAG CT G GGT CAT T G C C T T GC AC A 22 9 9 

2037 G GAT T G C C C CAT GAC CTTTCTTT CAAGAGT AT ACAAC CT A AAGAGGAAGT T CAT 2 09 0 

I I M II M II I II I I I II II II I I II I M I I I I I I M I II II I I 

23 00 GAAT T G C C C CAT GAC CTTTCTTT GAAGAAC AT AC AAC C C AAAGT T GAAGAG AAAAT CAGT 235 9 

2091 GTCCCAGAT GAGT TCTCCAAAGATAGGGGT GAT GTTTCAAAGGTGCCCGTACTGCCT CCA 2150 

N I I I I I M M II I I I I II II II I I I I II I I II I I I I 

2360 TTCTCAGAT GACT TTTCTAAAAATGGGTCTGCTACATCAAAGGT GCT CTTATTGCCT CCA 2419 

2151 GAT GTTTCTGCTTTG GAT GCT CAAGC AGAGAT AGGC AG C ATAGAAAAAC C C AAAGTT C T T 2210 

I I I I I I I II I I I II M II I MM II I I I 

2420 GAT GTTTCTGCTTT G GC C ACT CAAG C AGAGAT AGAGAG CAT AGT TAAAC C C AAAGTT CT T 2479 

2211 GT GAAAGAAG C C GAGAGAAAAC T T C C T T CT GAT AC AGAAAAAGAGC GAAGAT C T C CAT C T 2270 



I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

°k 2480 GT GAAAGAAG CT GAGAAAAAAC TTCCTTCC GATAC AGAAAAAGAG GAC AG AT CAC CAT C T 2539 

Qy 2271 G C T AT AT TT T CAG C AGAG C T GAGT AAAACT T CAGT T GT T GACC T C C T C T AC T GGAGAGAC 233 0 

I N I I I I I I I I I I I M I ! I I I i I I I I I I I I I I I I I I I I I II I I ! I I I I I I I I I I I | M I 

Db 254 0 G C T AT AT TT T CAG C AGAGC T GAGT AAAAC T T CAGT T GT T GAC C T C C T GT ACT G GAGAGAC 2599 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I ! I M I II I I I I I II I I I I I I I I I I I I I I I I | I I I | M I I I I I II II I I I I I I 
Db 2600 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 2 659 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I I I I I I Mill I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I M I I 
Db 2 660 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 2719 

Qy 2451 TTT AGGATATATAAGGGT GT GAT C CAGGCTAT C CAGAAAT CT GAT GAAGGC CACC CATT C 2510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 272 0 TTTAGGATATACAAGGGT GT GAT CCAAGC TAT C CAGAAAT CAGAT GAAGGC CACC CATT C 27 79 

Qy 2511 AGG G CAT AT T T G GAAT CT GAAGT T GC T ATAT C T GAG GAGT T G GT T CAGAAGT ACAGC AAT 257 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I | | | | | | | | || | | | | | M | | | | Ml 
Db 2780 AGGG C AT AT C T G GAAT CT GAAGT T GC T ATAT CT GAG GAGT T G GT T C AGAAGT ACAGT AAT 2839 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I'll I I I I II I I I I II I I I I I II I I I I I I I I I I I II I I I I I II I I I II I I I 

Db 2840 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 2899 

Qy 2631 GATTTAGTTGATTCTCT GAAGT TTGCAGTGTT GAT GTGGGTATTTACCTATGTTGGTGCC 2690 

M I I I I I I I I I I M I I I I I I I I I I I I I I I I I | | | | | || | | | | | || | | | | | | | | || M | || 
Db 2900 GATTTAGTTGATTCTCT GAAGT TTGCAGTGTT GAT GTGGGTATTTACCTATGTTGGTGCC 2959 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 27 50 

I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | || | M I M I I I 

Db 2960 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3019 

Qy 2751 AT T TAT GAAC GG C AT CAGGC GCAAAT AGAT CAT TAT CT GGGAC T T GCAAAT AAGAAT GT T 2810 

I I I I I I II I I I I I I I I I I I I II I II I I I I I I I I || | | | | || M M I I I I I I I I I I II 
Db 3020 AT T TAT GAAC G G CAT CAGGC AC AGAT AGAT CAT TAT C T AGGAC T T GCAAAT AAGAAT GT T 3079 

Qy 2811 AAAGAT G C TAT G G C T AAAAT C C AAG C AAAAAT C C C T G GAT T GAAG C GT AAAG C T GAAT G A 2 870 

N I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I | | | ! | | | | | | | | | M I I I I M II I I I I 
^ 3080 AAAGAT G C TAT G G CT AAAAT C C AAG CAAAAAT C C C T GGAT T GAAG C GC AAAG C T GAAT GA 3139 

Qy 2871 GAAAGC CT GAAAGAGT TAACAAT AGAG GAGT TT AT CT T T AAAG GG GAT ATT CATT T GAT T 2 930 

Mill II I I II 1 | || | | || | | | || || || | | M M I I I I I M I II I I 

D b 3140 AAAC G C C C AAAAT AAT T A GT AGGAGT T CAT CT T T AAAG GG GAT ATT CAT T T GAT T 3194 

Qy 2931 C CATT GGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTT CAC 2 984 

I M I I II II I I I M II I II I I M II II I II I I I II I II M II I I | || 
Db 3195 AT AC GG GG GAG G GT CAGGGAAGAACGAA- C C T T GAC GT T GC AGT G CAGT T T CACAGAT C G 3253 

Qy 2985 AGAT C T T T AT TT T T AGC AACGCAGT G - T CT GAGGAAAAAT GAC C T GT CT T GACT G 3038 

I M I I II I II I I M II I III I I I I I I I I | II II II II 

Db 3254 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 3313 

Qy 3039 CCCTGTGTT CAT CAT CT T AAGT AT T GT AAG CT G CT AT GT AT GGAT T TAAAT CGTAAT CAT 3098 

M M M M II II I I I II II I || I II II I I II II || I I M I II I I II I I I II I II II I I 



Db 3314 C CAT GT GT T CAT CAT C T TAAGT AT T GT AAG CT GC T AT GT AT GGATT T AAAC C GT AAT CAT 3373 

Qy 309 9 AT T T GT T T T T C C T GT AT GAGGC ACT G GT GAATAAAC AAAGAT C T GAGAAAG C T GT AT AT T 3158 

I I M II I II I I | | | | | M | | | | | | | | | Ml mi, | 

Db 3374 A TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC C T GT AAAT T 3420 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGGAA-TTGCAAAGAAAGTGGAGCTGAC 3217 

■Mhll | | I I I I I I II I I I I II I I | | M I I I I I I I I MINI | | 
Db 3421 TTACTTTGTTGCAGATAGTCTTGCCGCATCTGGGCAAGTTGCAAAAATGGTGGAGTTAAA 34 8 0 

Qy 3218 AGAAATAA 3225 

I I I I I I 
Db 3481 AAAAAAAA 3488 
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AB040462 4166 bp mRNA linear 

Homo sapiens mRNA for RTN-xL, complete cds . 
AB040462 

AB04 04 62.1 GI: 11610574 
reticulon. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Tagami,S., Eguchi,Y., Kinoshita, M. , Takeda,M. and Tsuj imoto, Y . 

A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

endoplasmic reticulum and reduces their anti-apoptotic activity 

Oncogene 19 (50), 5736-5746 (2000) 

21010696 

11126360 

2 (bases 1 to 4166) 

Eguchi,Y., Tagami,S. and Tsujimoto, Y. 
Direct Submission 

Submitted (22-MAR-2000 ) Yutaka Eguchi, Osaka University Graduate 
School of Medicine, Biomedical Research Center, Department of 
Medical Genetics; Yamadaoka 2-2, Suita, Osaka 567-0871, Japan 
(E-mail : eguchi@gene.med. osaka-u. ac. jp, Tel : +81-6-68 79-3363, 
Fax:+81-6-6879-3369) 

Location/Qualifiers 

1. .4166 

/organism="Homo sapiens" 

/mol_t ype= "mRNA" 

/db_xref="taxon: 9606" 

/tissue__type="brain" 

/tissue_lib="human fetal brain" 

1. .4166 

/gene="RTN-x" 

245. .3823 

/gene="RTN-x" 

/note=" reticulon family" 

/codon_start=l 

/product="RTN-xL" 

/protein_id="BAB18927 .1" 

/db xref="GI: 11610575" 



/trans la tion=''MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED 
EDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAP 
ERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWT 
P P AP APAAP P S T P AAP KRRG S S G S VDET L FAL PAAS E P VI RS S AENMDLKEQ P GNT I S 

AGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEV 

SEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKL 

VSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFER 

VWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPST 

PEGIKDRSGAYITCAP FNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEK 

NTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 

TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV 

PSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPEN 

INT^ALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSP 

DSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSAXPPEGGKP 

YLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 

ETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 

LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALATQAEIESIVKPKV 

LVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 

LTVFSIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLESEVAISEELV 

QKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALI 

SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE " 

ORIGIN 



Query Match 61.9%; Score 2372.6; DB 9; Length 4166; 

Best Local Similarity 86.2%; Pred. No. 0; 

Matches 2776; Conservative 0; Mismatches 399; Indels 44; Gaps 



Qy 


l 


CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 
N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | 1 1 1 I I i | | | | | | | | | | | || || 

CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 


60 


Db 


956 


1015 


Qy 


61 


GT ACT GC C C ACT GAAGGAACACT T C C AGCAAC T T CAAAT GAAGC T T CT AAAG CATT C T C A 

111 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 

GT AT T AC C CACT GAAGGAACACT T CAAGAAAAT GT C AGT GAAG CT T CTAAAGAGGT CT CA 


120 


Db 


1016 


1075 


Qy 


121 


GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAATT TAACAGAAT T T T CAGAAT T GGAAT AT 

1 1 1 1 1 1 1 1 1 1 I | | | | | | | | M | | | | || | || | | | | | || | | | | | , | 

GAGAAGGCAAAAACT CT AC T C ATAGAT AGAGAT TTAAC AGAGT T T T CAGAAT T AGAAT AC 


180 


Db 


1076 


1135 


Qy 


181 


T C AGAAAT GGAAT CAT CAT T C AGT GG CT CT CAAAAG G C AGAAC C T GC C GT AAC AGTAG C G 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 i mm; 1 1 1 1 1 Ml | | | M 1 M 1 1 1 1 1 1 1 MINI 

T C AGAAAT GG GAT CAT C GT T C AGT GT C T CT C CAAAAGC AGAAT CT G C C GT AAT AGT AGC A 


240 


Db 


1136 


1195 


Qy 


241 


AAT C CT AGGGAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAG GAC T T AGT T AGT C T T 
1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 II 1 II 1 I I I I I | || | | | | | | | 
AAT C C TAG G GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 


300 


Db 


1196 


1255 


Qy 


301 


AACAT C CT TCATACT CAGCAGGAGTT AT CTACAGT C CTTAC GAAAT CAGTT GAAGAAGAA 
1 1 1 1 1 1 1 1 1 1 1 1 1 III II 1 1 1 1 I I MINI 1 1 I | | | | | | 1 M 1 1 1 1 II 
AACAT CCTT CAT AAT CAACAAGAGTTAC CTACAGCT CTT ACTAAATT GGT TAAAGAGGAT 


360 


Db 


1256 


1315 


Qy 


361 


GAT AGAGT T C T GT C T C C AGAAAAAAC AAAG GACAGT T T TAAG GAAAAG G GAGTT GC AG C A 

11 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I I | | | 1 1 1 1 1 II | || | | | | 

GA— AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 


420 


Db 


1316 


1372 


Qy 


421 


GAAGC T T C TAT GG GG GAGGAAT AT G C AGAC T T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 

111111 '''I' 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 I I I | || || | | | | | | || 


480 



1373 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1432 

481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 
'l 11111 1 Ml MIIIIIMII I Ml IMIIIIIMII II I || 



1433 AAAGATAGT AAGGAAGATAGT GAT AT GTT GGCT GCTGGAGGTAAAAT C GAGAG 



541 
1490 

601 
1550 

661 
1610 

718 
1670 



CAAC 14 89 
600 



TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 
1 1 I 1 I I I N I I I N I I I I I III Ml I I I I II | | | | | | | | | | | | | | | | in 

TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 1549 

AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 
1 I 1 1 1 I I I I I I I I I I I I I I II I I I Mill I I MINIM I I I II M I III 

AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1609 

GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 7 1 7 

1 I UK I I I I I II I I I I I I I I M II I II I II I II I I || I II I II I || 
GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1669 

TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 7 7 7 

I 1 1 1 I I I I I I I I M II I II II I I I II I MUM II I I M I II II I I I I II I II 

GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1729 

778 AAGATAGAA- AAAAAAAGGC AC AAAT T GT AAC AGAGAAGAAT GC AAGT GT CAAGAC AT CA 836 
„ I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | ,, | | | | I M I I II I II 

AAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCA 178 9 

AAC C CT T T C C TT AT GGC AGCAC AGGAGT CT AAGACAGATT AC GT T ACAACAGAT CAT GT G 896 

1 I 1 1 1 I I I HI I N I I I II I II I Ml II II I I I I II II II II II I II II I 

AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 184 9 
TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

i II M II , i | II I II I I M II I I II II I II I II I || M II II I I II 

ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1909 

CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

Kim I I N II II I II I I I || M II M I II M II II I I II II I I I I II M 

CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1969 

AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 
HI I I M | || | | | I I I I I II II I II II I II | | | 

AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 2029 

CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 
" I I 1 1 I I 1 I I H I I I II I M M II II I II II Mill II II II II II I II II I I 
CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 2089 

ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 119 6 

1 1 I I > I I I I I I I I I I I I M M I II I I II I | || || || I I | M I I Ml 

ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2149 



1730 
837 
1790 
897 
1850 
957 
1910 
1017 
1970 
1077 
2030 
1137 
2090 
1197 



TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

I I I M II I M II I II I I || || I I I II I II II II II I || M II I I I I I I II II 

2150 TCATCACCATTAGAAGC— TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2206 
1257 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

„ nn 11 MM IMIMIMI Mill Mill II MIIIMM II 

2207 AAC C C C C C AC CAT AT GAAG AG G C CAT G AGT GT AT C ACT AAAAAAAG TAT C AG G AAT AAAG 2266 



1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

I I IINII I I I I I I I M I I Mill II I I I I Mil III Ml MMIM Mill 
2267 GAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTAT 2326 



1377 
2327 



ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 
M II I II II II I I | | | | | | | | | | | | | | | | | | | | in | | | | | | | IMI M | 

ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2386 



1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 1496 

''I'M I I I I I II I II I I III N II I I I I II II II II I I I I | | | | in 

TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 2446 



2387 
1497 
2447 
1557 
2507 
1617 



GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

1 1 1 1 I I I M M II I II II I I I I I Ml I I II | | | | | | || I II I I I I I 

GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2506 

GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

1 I I M II I I I I I II I II I M I I I MM II I I I II II I II II I I | | | 

GAC GT T C C ACAAAAACAAGAT GAAACT GT GAT GCT T GT GAAAGAAAGT C T CACT GAGAC T 2566 



TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

1111 MM I II I III II I MM MMIM Mill I I I I | Mill 

2567 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 2623 

1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

M I I I I M I M II I I I I II I I I I I I I | M | M | II I I I I | | | | | | | M II I I 

GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 2 683 
TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 1796 

1 M M I I M II I I I M I I I I I II I II II I I I II II I I II I I I II II 

CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 2743 



2624 
1737 
2684 



1797 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

1 I M M I I M I I M II I II I II I II M I II II I M I II I II 

GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2 803 



2744 
1857 



AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

oori , 1111111 I I I I I I MINI Mill I I MM I I I f I I I I I I I I I 

2804 AG AG AAAC T G AAAC G T T T T C AG AT T CAT C T C C AAT T G AAAT TAT AG AT G AGT TCCCTACA 2863 



1917 
2864 
1977 
2924 



TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II M M M II I I I I I | | II I I III I I I I I I I I I I I I M I I II I II II I I I I 
TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2 923 

GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I M M II I M II I I I I I I I I I II I II M II I I I II I II II | | MM II 

GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2983 



2037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2090 

1 M I I II M I I I II I I I II I II I I M II II II I I I II I II II I I 

GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 3043 



2984 
2091 
3044 



GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

' ' ' I M II I II II Ml II II II I MM Ml III I II MMIM I 

TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 3103 



2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

I I I I I I I I I | I I I I I | I I I I I I I I I I I I I | | | | | | | | | | | I I I | | | M | | | | | 
3104 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 3163 

2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 2270 

I I I I I I I I II I I I I I II I I I I I I I I II I I I I I I I I | | | | | M Mill I I II I I 
3164 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 3223 

2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

I N I I I I I I I I I I I I M I I I I I I I I I I I I I | | I I I I M I I I I I I I I | | | | | | | | 

3224 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3283 

2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I | | I || Mil'!,, 

3284 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 334 3 

2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

1 1 ' 1 1 1 1 1 1 1 ; i 1 1 1 ii 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

3344 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3403 

2451 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Ill II | | | | | | | | | | | | | | || 

34 04 TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 3463 

2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 

minimi ii 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 M i M ii in 

3464 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 3523 

2 571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I I MMI I I I I I I II I I I I I II I I I I I I I I I I I I I I M 

3524 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3583 

2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

M I M I I I II II I I II I I II I I II II I I II II II II II I II I I II I II I I II I II II I II 
3584 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3643 

2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

H I I I I I I I I II I M I I II I II I I II I I I I I I I I I I I I I I I I I I I I 

3644 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3703 

2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

N N I I I II I I II I I I I I | | | | I I I I I I I I I I II I I II I I I II II I I I I I II I I I I I 
37 04 ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3763 

2 811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGA 2870 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I MM Ml 

3764 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGA 3823 

2871 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 2930 

1 I I 1 1 I I I I I I I II I I I I I I I I I I I I II I II I I I I I I I I I II I I I I 

3824 AAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 3878 

2931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 2 984 

I II N I I I I I M I M I I M I I II I I I I I I I I I I I I II II II II II II 
3879 ATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3937 

2985 AGATCTTTATTTTTAGCAACGCAGTGT-CTGAGGAAAAATGACCTGTCTTGACTG 3038 



Db 


3938 


Ov 


3039 


Db 


3998 


Qy 


3099 


Db 


4058 


Qy 


3159 


Db 


4114 



I I I I I I I I I I I I I I II I I III III 1 1 1 1 1 1 1 1 II I 1 1 1 II 1 1 1 II 1 1 1 1 

TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 



3997 



C C CTGT GTT CAT CAT CTTAAGTAT T GTAAGCT GCTAT GTAT GGAT TTAAAT C GTAAT CAT 3098 

11 ' ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I II I I I I I I I I | I I I 

C C AT GT GT T CAT CAT C T T AAGT AT T GT AAG CT G C TAT GTAT GGAT T TAAAC C GTAAT CAT 4 057 

AT TTGTTTTTCCT GTAT GAG GC AC T GGT GAATAAAC AAAGAT C T GAGAAAG C T GTAT AT T 3158 

I 1 1 1 1 1 1 I I Ml | || | | | | | || 

ATCTTTTT C CT AT C T AAT C TAT C T GAGGCAC T G GT GGAATAAAAAAC C T GTAT AT T 4113 



I I I I I I I II I I I I I I I I I I I I II I || || | 
TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGT 



4152 
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1. .4093 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db xref="taxon: 9606" 



FT 
FT 



Query Match 61 . 8% . score 2370.4; DB 6; Length 4093; 

Best Local Similarity 86.4%; Pred. No 0 • 

Matches 2807; Conservative 0; Mismatches 386; Indels 55; Gaps 15; 

1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 
QCO 11 I N I II I I I | | | | | | | | | MM M I I II I I I I I I I | | | | M | | || | M II 

863 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 922 

61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 
n 111 1 N M M I I II II M I I I II II II | | | M I II I I M II II I II I I 

923 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 982 



121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 180 

!!!!!!!! iJJ 1 11 1 1 1111 hi ci mi mi i i 



983 GAGAAGGC-AAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATA( 



1 8 1 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGi 



C 1041 
CG 240 



■■■■MINI Mill MMIM M I J I I I I MINIMI MINI 

TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1101 



1042 

* — — ~ * ■» ~' j- ■■- j. j. , -^'wirt.u<^«.ij/v\'l'UX(jUUGTAATAGTj 
MTCf!TAf;RRarf:aEaTSCTTr!nejifr-7,/'m»/> I ^.m. - 

?CTT 300 



2 4 1 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTC 



1102 



I I I I I I I I I | | MM I II M I I II I I I 



I I I I I I I I 



AAT C CT AGGGAAGAAATAAT C GT GAAAAAT AAAGAT GAAG AAGAGAAGT T AGTT AGT AA' 



.T 1161 



301 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 



1 1 I I I I I I II I I I I I I | | | M I I I I I I M I I II I I II I I I 



I I I 



1 1 62 aacatccttcataatcaacaagagttacctacagctcttactaaattggttaaagAggat 
361 gatagagttctgtctccagaaaaaacaaaggacagttttaaggaaaagggagttgcagca 



1221 
420 



U lliLlillLiiiiiiiL 11 1 1 II Nil Ml I I MUM Ml IN II 

1278 



1222 ga — agttgtgtcttcagaaaaagcaaaagacagttttaatgaaaagagagttgcagtg 



421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 80 

10 ™ 111111 11111 I 1 1 " I I I N I I II II M I M II M II I I II I I II I I I II I I I I I II 

1279 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1338 
4 8 1 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 5 4 0 



! 1 1 11 1 1 1 HI I M I I M I I M I III II II M I II I I I M II I I I I I I 

.C 1395 



1339 AAAGATAGT- - -AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAi 



541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 
111111 I I I M I M II I II III | M II II I II II I I I I Mill MM II 



1396 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 



1455 



601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

, Allr 1 1 1 1 1 1 I 1 11 I 'I I 'I I II II I I I M I II II II I I II I I MMIM I I II 

1456 AAAGATAGT GAGAGT AGTAAT GAT GAT AC T T CT T T C C CCAGTAC GC C AGAAG GTAT AAAG 1515 
661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

icn<r 1 1 11,1 "Ml II Mill I I M M I M II II II I II II M II II I II 

1516 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1575 

718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 
Uli™iiiJiJiLi!_ 1 11111 ''''I' I I I I M II II I I II I II I I I 



1576 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 



778 ^GATAG^-AAAAAAAGGCACAAATTGTAACAGAGAAGAATGCAAGTGTCAAGACATCA 836 

i«« ILUI" 1 1111 11111 1,111 NIMIIIMIIIM I II Ml Mill 

1636 AAAATAGAAGAAAAGAAG GC C C AAAT AGTAAC AG AGAAGAAT AC TAG C AC C AAAAC AT C 



TCA 1695 

837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 896 

1755 



NMI III I I I || I I II II I M 



1696 aacccttttcttgtagcagcacaggattctgagacagattatgtcAcaacagAtaattta 



897 



tcaaaggtgaccgaggaagtagtggcaaacatgcctgaaggtctaaccccagatti 



1876 



17 „ 1111 111 11111111 NIIIIMIMIIIIM GATTTGGTT 956 

1756 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1815 

957 caggaagcatgtgaaagtgaattgaatgaagctactggtacaaa^ 1016 

, a ,r M M 1 1 1 11 1 I I I I I I I II I M I I II I I I I I II I I I I I I I II I Mill I I I I I I I I 

1816 caggaagcatgtg^ 1875 

1017 ^TGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 
Ji ' ' 1 I I II I I II I I I I I I M I I I I | I II | | II I || M I I I I II II 

AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1935 

1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

1 1 1 1 11 1 1 1 " I I I I I I II II I I I I M I I I I II I I I I I I I I I I I I I I I I I 

1936 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGA^ 1995 

1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

1 11 11 1 ' I I I II I I I I I I I II I II I II I II I I II I I I I I I I I I I I I I I M i 
1996 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2055 

1197 ^^yy^^yyAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

Li! 11 1 1 1 1 11 11 1 1 11 I I I I I I M I I I I I I I I II I I II I I I I I I I I I I I II 

2056 TCATCACCATTAGAAGCTT ^TTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA. 2112 

1257 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

UN I Mill II II I I I I M I II I I I ! I 1 1 i I I ! | | i | M II Ml II II II 



2113 aaccccccaccatatgaagaggccatgagtgtatcactaaaaaaagtatcAggaAtaaag 



2172 



2173 



2233 
1437 



1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

ill 11 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II II II 1 1 1 1 II 1 1 II 1 1 1 II II II 1 1 1 II 

GAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTAT 2232 

1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACC 1436 
INN II I I I II I I I II I || I I || I II II I I I || I I I ill II II I I I II I I III 

ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2292 
TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 1496 

111111 N I I I II I II M III I | | | I I II I II II I II II I I I I l I I i i i i i i i 

2293 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCACT 2352 
GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

11 N M I I I I I I II II II I I || || II I I II II I II I I II II II I II I I II II I I II I 

GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2412 
GAACTTCCACAAAAACAAGA 

11 1 11 11 'I M I II M | | | m M I II I II || | || || II II I I I I II M I I I l 

GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2472 



1497 
2353 
1557 
2413 
1617 



TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACC 



T 1676 



1111 1 1 I I I I I I I I I II I Nil I I I I I I I || | | II | I II i II i i i i i i i 
2473 TCATTTGAGTCAATGATAGAATATGAAAATAAG^ CACCT 2529 

1677 GAGGGAGGAAAAC^^ 

„ c ™ I I M I I II I I I II II I I I I I I I | I I I IN I I I | I I I I I I M II I I II I I I I 

2530 GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCACT 2589 

1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 1796 

ocn „ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 linn | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 . . 

2590 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGA^ 2649 

1797 GAGCTCAATACTGCAGTTTATTCAAGT 185g 
I I M I I I M I I I I I I I I I I I I II I I I I l l i i i i i i iii i> 

2650 
1857 
2710 
1917 
2770 
1977 



■''Ml, 1 I I I I M I I I I I I I M | | inn MINI Ml II MIIMII I II 

GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 270 9 



111,111 111111 I I I I I M M I I I I I II I MMI I I I I | || I II I M II I I II 
AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2769 

TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGAC 1976 

1 1 I I I I I I M I I I I II I M I I I I I | M I I I II II M I II Mill ll 

TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2829 

M? 3 ???t?1^?T?t^TT??T GACATCCAGGATGGAGCTGGGTCATTGGCTTGT GCA 2036 
1,1 1 11 1 1 I I II II I I I I I M I I II I II I I I I II I I M I I I I I I I I I I i i 

2830 gtatcccacaaaagtgaaattgctaatgccccggatggIgctgggtcatt^ 2889 

2037 
2890 
2091 
2950 
2151 
3010 
2211 
3070 



GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTA AAGAGGAAGTTCAT 2090 

I MM I I I I I I M I I II I I I I I MM MIIMII I Mill II I I 

GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2949 

GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 
' ' I N I I I I II II III II || || | I | I I I I II M I II I II II I II 

TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA. 3009 

^^"T" ^T'T' T" ^^"T'T' *y GC T caagc agagat aggcagcat agaaaaac ccaaagtt c T T 2210 
11 1 I M I I M I I II II II II I II II II || I I I M II I I I M I II M II I II I I 

GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 3069 

?y?t^?^f? C ? A ?f ^^^^^T^CAGAAAAAGAGCGAAGATCTCCATCT 2270 
1,111111111 I N I II I II II II I II II || | M I I I I M II Mill MIMI 

GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 3129 
2271 ^y^y^yyy^^^^^^'-'TGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I II M I M II M I II I I M I M II I I I | I I II II I I II I II I 

GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3189 



3130 
2331 



ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 
, lnft ' ' ' 1 ' ' 1 ' ' 1 1 11 1 ' 1 11 1 11 1 1 ' I " I I II I I I I I M I I I I I I I I I MIIMII 
3190 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3249 

2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 
il'J Ml I I I I I I I M II I I || M I I II I M II I I II I I II I I | | || I I 

C 3309 



3250 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAG 



2451 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 
1 11,1 '''I' M I II I I | || I I || II I II II II I II II I 



3310 TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 3369 
2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 



I I I I I M I I I II I I I I I | | | | | | | | | | | | | 



I I 



3370 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 3429 
2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I I I I I I I M I I I I II I I || I Mill I II I I I I I I I II II II I I I I I I I I II 

3430 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3489 
2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

^ on M N N II I II II II I I I II I II I I II II I II I I I i || i i i i i , i , i i i i i i , , 

3490 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGT 



GCC 354 9 

2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 

"ILL! II!!!!!! 1 11 1 1 1 1 1 minim m n i n i n 1 1 1 n 1 1 n 1 1 n i 



2750 

qRRn ' ' 1 1 1 11111 M I I I I I M I M I I I I II II I I I | N I I I II I I II I I II I II 

3550 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3609 



2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

M M I N I II I I I I I I I N I II I I II II II I I II || II II I I I I I II I II II I M I l 

3610 ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3669 
2811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAi 



GCTGAATGA 2870 

M M II I II I II II I I I I II I I I I I II I I II I II | || | I II II I I I I | | I I I I I I I I I 

TGA 3729 



3670 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAA' 



2 8 71 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 2930 

„, n ' ' 1 1 Ml II I II II II I M I II I I II I II || || II I 

3730 AAAC GCC C AAAATAAT TA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 3784 

2931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 2984 

0 , oc 1 I M I II I I II II II I II I I I II I II II I I I I I I II I I Nil 

37 85 ATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3843 

2985 AGATCTTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTG 3038 

l!li!!!!i!!!!!J !!J J 1 1 " 1 mini inn niininnin 

GACTG 3903 



3844 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTT 



3039 CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 3098 

M M I I II II I II II II I I II II II I I N II I I I I II I II II I II I II I I I II II II I 
3904 CCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCAT 



3963 



3099 ATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATT 3158 

1 I I I M I II I I I I N II II II I II I | | || | | | | , 

3964 A-TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATT 



4010 



3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCTGAC 3217 



'MINI INI II II I II II I II INI I || | | | III | I | I I I I I 
4011 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGA 4070 

3218 AGAAATAA 3225 

I III II 
4071 AAAAAAAA 4078 
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BD249446 
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Protein similar to neuroendrocrine-specif ic protein, and CC 
encoding cDNA 
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source l. .3579 

/organism='Homo sapiens (human)'. 
Location/Qualifiers 
1. .3579 

/organism="Homo sapiens" 
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PN 
PD 
PF 
PR 



9916898.1 PI 



CC 

FH 
FT 
FT 



Query Match 58.0%; Score 2223.6; DB 6; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 



Qy 


l 


CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 
11 1 1 1 M 1 1 1 1 1! 1 1 1 1 1 1 I | | | | | | | | | | | I | | | | | | | | | | | | |,|| || 

CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 


60 


Db 


712 


771 


Qy 


61 


GT ACT G C C CACT GAAG GAACACT T C C AGCAACT T CAAAT GAAGC T T C TAAAGC AT T CT C A 

HI 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II || | | | | | | I | | | | | | | | | mi, 

GTATTAC C CACT GAAGGAACACTT CAAGAAAAT GTCAGT GAAGCTT CTAAAGAGGT CT C A 


120 


Db 


772 


831 


Qy 


121 


GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 
11 1 ' 1 1 1 1 1 1 1 11 'I 1 1 1 M 1 Ml 1 1 1 1 1 1 1 I | | | | | | | | || | | | i ; | ; 

GAGAAGGCAAAAACT CTACT CATAGAT AGAGATTTAACAGAGTTTT CAGAATTAGAATAC 


180 


Db 


832 


891 


Qy 


181 


TCAG^TGGAATCATCATTCAGTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGC^ 
1 1 1 1 1 1 M M 1 1 1 I | | 1 | 1 | 1 | | Mill III ii i i i i i . > . ,,,,,, 


240 



892 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 



241 



AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCT1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I, i 7m rf,7u7?r?^M u ; ;?Tr r T 300 

952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGT 1011 
301 ^CATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAAT^^ 360 

1mo I I M I I M I I I I I I I I || I | | | M I I II I | | | | I I Mil III I I I i II 

1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

361 GATAGAGTT CTGT CTC CAGAAAAAACAAAGGACAGT T TTAAGGAAAAGGGAGTT GCAGCA 420 
" 1111 11111 I I I I I I I I I I I | | | || | | || I | | 11,111 I I | I | | | | i 
1072 GA---AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGtIgc^ 1128 

421 ^CTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAG 480 
110Q J.!"" 11111 IIIIIIIIIMI| IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMI 
1129 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTaIgGGAAG 1188 

481 ^GATACyACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 540 

HI 1111111 1 111 MINIMI III II I I MINI 

AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 1245 



1189 AAAGATAGT 

TCGTGAA 600 



541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATC 



1111 M II II I I || I M III Ml II M I II I I II II III I in 

1246 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAG^ 1305 
601 ^ATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAG 660 

11111111111 11 ' ' I I I I I I I | J IMM II II I I || || II M I I I I II I 

AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1365 



661 



1 ' 1 I_J '1 1 ' 1 1 1 1 1 1 inn if Mmi'il'miiii i in 

:CAGAAGGTATAA 

GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 
' ' ' I I I I I II I I II I I I I II M II I I II M I II || II I I I II II 
1366 GATCGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1425 
718 TCAACAAACAT^ 

1!'' 11,1 11111 1 "III MIMI II Ml I III I I MM MM I 



1426 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACT^ 
7 7 8 AAGAT AGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGAi 



1485 



11 111111 1 1 ii ii i ii 1 1 M i mi i 7i7i7m7i"m 7?F7?T?m 836 

I486 AAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCA 1545 

837 ^CCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 896 
1Cjlc I I II II I I III | || II I II I I M III II M II I II I II || | || | | || II I 
1546 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 

897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 
ncnr 1 1 N I I II II II II I II I II I II I I II I II II II I I II II II I II M I I I II 
1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCC^ 1016 

^rr ' I M M I II II II I II I II I II II II | || | | M I II Mill I I I I II I I 

1666 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTA^ 1725 
1017 ^TGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

111111111 M II M I I II II II II I I M II I II I II II || I M I I I I I I I 

1726 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1785 



1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

, nac 1 1 1 1 1 1 I I I I I I I I I I I I I || I I | | | | | | M I I I I I I II I I I I I I I I M I I I I I 

1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

1ft „ c ' I N I I II I I I I I I I I I I I I I I I I I I M I I I II II I I I II I I I I I I I I ill 

1846 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCA.GC 1905 

1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 125 6 

1Qnc N M M I I I I I I I I I M I I I I I I I | | | | M I I I I I I I | | || | | | | | | 

1906 T CAT C AC CAT T AGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCT 



1257 
1963 
1317 
2023 
1377 
2083 
1437 



GAA 1962 

AAT C C C C C AC CAT AT GAG GAG G C CAT GAAT GT AT C ACT AAAAAAAG AAT C AG G AAT G AAT 1316 

I 1 1 1 1 ' 1 1 • I t I II I I ! I I ! I I I I I I f I I i I I f I I f | | | | [ | | IIMMIII N 

AAC C CCC CAC CAT AT GAAGAGG C CAT GAGT GT AT C ACTAAAAAAAGT AT C AGGAATAAAG 2022 
GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

IN I I I I I I I M I I I I I I I I II I I II I II I I II I II II I I I I I II I I I I I I I 

GAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTAT 2082 
ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

1 1 1 1 1 1 1 1 I M I I I M I I II I II II I II I I || | M I I Ml I I I II I I | | || in 

ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 



TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 1496 

01/10 hini 1 1 1 1 1 1 1 1 1 1 1 1 mi 1 1 1 1 1 1 1 1 ii 1 1 1 1 M 1 1 ii 1 1 1 1 1 1 ii ii 1 1 

2143 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA. 2202 
GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

I 1 1 1 1 1 1 1 1 1 1 I I I H I I I M II I I II I II I I I II I I I II I I II I I I | | || | | | || I 

GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2262 
GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

I I MIIMMIMI I I I I I I I I II I I I I I I I I I I M I I | | I I I I I I I 

GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2322 



1497 
2203 
1557 
2263 
1617 



TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 
oo _ N I I I I I I I I I I I I I II I I I I I I I I I | M I I II I I I I I II | | | | Mill 
2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 2379 

GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

1 1 1 1 1 1 1 1 1 I ' H INN || M | I II I I I I I | I II || || | | || | | 

GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 2 439 

TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 1796 

I I ' I I J I I I I I I I I I I I I I I | | | | II I II I I || | | || || || M I II I I II I I 

CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 2499 

GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

11 1,1 1 1 I I Mill MM I I Ml I), I II 

GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2559 



1677 
2380 
1737 
2440 
1797 
2500 
1857 



AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

ii-iiiil^iilLliiiiiiilii 1 ! 1 1 1 I I I I II I | 

™™ . _ CCCTACA 2 619 



2560 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTT 



1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

11 I ' I I I I I I I | | | IMM I I I I | | | | | | | I I II I || I I i i > i I i i i i i i i 

2620 TTGATCAGTTCTAAAACTGATTCATTTTCTAAAT^ 2679 
1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTG 1 



„ an ■" M m m i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 , 1 1 m [mm m irriif G ^ 2036 

2680 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2739 



2037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC- 



-TAAAGAGGAAGTTCAT 2090 

i i i i i i i I M I I I II I II I I I I I II I I II I I I I I 

2740 



' I I I I I I I I I I I I I I I I I I I I I MM i i I I II I I I I I l 

GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2799 



2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

1 I I I I I I I I 'I N Ml M II || | | || II II I II I II I I I I I II I 
2800 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA. 2859 

2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

, 0£ „ N I I M II I I I I II II I II II I I I II I II I I II II II I || | | | | M II I I I I I 

2860 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 2919 

2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 2270 
I I I II II II II II II M I I I I I I I I I I M i i i i 

2920 



-II I I I I I I I II II II II II II II I I II II I II I I | | M II Mill MUM 
GT GAAAGAAGCT GAGAAAAAACTTC CTT C CGATACAGAAAAAGAGGACAGAT CAC CAT CT 2979 



2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAi 



OQOn 1 1 1 I 1 1 I I I I I I I M I I II I II II II I I II II I I M I I II II II II I I II II M II I II 

2980 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3039 



2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 
3040 i!I!Iii!i-Ul^!iili.i.illli!l M ' ' ' ' ' I I I I I I I I I I I M II | 



ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

.GC 2450 



2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAI 



3100 



I I 1 I INN N N II II II M II II I | || M I I II II I I I 



I I 



TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAG 



C 3159 



2451 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCAT' 



1 11 1 11 1 1 11 1 N II I I II I M I I I I II II II I I Mill TmmT II mTm TTl M 2510 
3160 TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 3219 



2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 



1 1 I N II II I II II I I I II || || || M M II II I || II II II II II I M I I II II III 
3220 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAA 3279 



2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

" M I N I II I II II II I M II I II Mill II II I II I I II | || | | II || | | i i | i 

3280 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

N I N I I I II II II I II II II II I || I I II I II II | || | | || || II II II II II II I II I 
3340 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGtAtTTACCTATGTTGGTGCC 3399 

2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I N I I I I | | || || || | II | M I II II I I I II II M II II I I l I l l 

3400 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGT 



2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATG' 



GTTCCTGTT 3459 
TT 2810 



Db 34g0 ilillli.ilii.iilliiliiiiJ J I M I I I I I M I I I I I I I I I I I I I I | | | | 



ATTTATGAACGGCATCAGGCGCAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3519 

AATGA 287 0 



Qy 2811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTG. 



Db 3520 AAAGATGCTATGGCTAAAATCCAAGCAAAA^ 357g 
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Result Query 

No, Score Match Length DB 



ID 



1 


3833 


100. 


0 


3833 


3 


AAD01174 


2 


2687.8 


70. 


1 


4822 


6 


ABS70449 


3 


2649.8 


69. 


1 


4632 


6 


ABV94680 


4 


2566.4 


67. 


0 


4698 


7 


ABX34563 


5 


2379.4 


62. 


1 


4053 


4 


AAS09453 


6 


2379.4 


62. 


1 


4053 


8 


ACC81048 


7 


2370.4 


61. 


8 


4093 


3 


AAA23454 



Description 



Aad01174 Bovine ne 
Abs70449 Human bon 
Abv94 68 0 Human pan 
Abx34 563 Human mdd 
Aas 094 53 Human cDN 
Acc8104 8 Human Nog 
Aaa23454 cDNA enco 



9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 
43 
44 
45 



2223. 6 
2223. 6 
2223.6 
2223. 6 
2183.4 
2179.4 
1777.2 
1774 
1414. 6 
1039. 8 
1031. 8 
1025.4 
989.2 
986 
827.2 
826.4 
750.4 
685. 8 
685. 8 
681.4 
681.2 
590.8 
573.4 
573.4 
552.4 
539 
538.8 
535 
535 
527 
527 
527 
464 
463.4 
427.2 
427 
423. 8 
422 



58.0 3579 3 AAZ56886 
58.0 3579 4 AAF90324 
58.0 3579 6 ABK90134 
58.0 3579 6 ABN86601 

57.0 4684 3 AAD01173 
56.9 4684 6 ABN86600 
46.4 2386 2 AAV30920 
46.3 2386 5 AAF98399 
36.9 1980 4 AAI98079 

27.1 1758 4 AAF32725 
26.9 2240 3 AAC64406 
26.8 2235 6 ABV94681 
25.8 1514 6 ABK34580 

1798 6 ABK90135 
2052 6 ABK90133 
2782 9 ADB85284 
1568 3 AAD01175 
1213 2 AAX04379 
1610 3 AAZ36230 
991 2 AAX97587 
1694 4 AAK94408 
799 2 AAV23695 
770 3 AAA72983 
1216 6 ABA05903 
2100 7 ABT42936 
1683 4 AAD08386 
868 3 AAZ56887 
4710 4 AAL04697 
4710 4 ABL97604 
7 600 4 AAF90323 
7 1122 3 AAZ56888 
7 1122 4 AAF90325 
1 472 7 ABX50578 
1 3535 4 AAH72837 
1 447 7 ABX50261 
1 441 7 ABX53872 
1 429 7 ABX53062 
0 422 7 ABX43312 



25 
21 
21 
19 
17 
17 

17.8 
17.8 
15.4 
15.0 
15.0 
14.4 
14.1 
14.1 
14 . 0 

14.0 

13 

13, 

13. 

12. 

12. 

11. 

11. 

11. 

11. 



Aaz56886 
Aaf90324 
Abk90134 
Abn86601 
Aad01173 
Abn86600 
Aav3 0920 
Aaf 98399 
Aai98079 
Aaf32725 
Aac64406 
Abv94681 
Abk34580 
Abk90135 
Abk90133 
Adb85284 
Aad01175 
Aax04379 
Aaz36230 
Aax97587 
Aak94408 
Aav23695 
Aaa72983 
Aba05903 
Abt42936 
Aad08386 
Aaz56887 
Aal04697 
Abl97604 
Aaf90323 
Aaz56888 
Aaf90325 
Abx50578 
Aah72837 
Abx50261 
Abx53872 
Abx53062 
Abx43312 



Human MAG 
Human NOG 
DNA encod 
Human neu 
Rat neuri 
Rat neuro 
Human sec 
Human cDN 
Human neu 
Human sec 
Human Nog 
Human pan 
Human cDN 
DNA encod 
DNA encod 
Rat fooce 
Rat neuri 
Human sec 
cDNA en co 
Extended 
Human ful 
Human NSP 
Human NSP 
Human RTN 
Human neu 
Human sec 
Human MAG 
Human rep 
Human tes 
Human NOG 
Human MAG 
Human NOG 
Bovine ES 
Human cer 
Bovine ES 
Bovine ES 
Bovine ES 
Bovine ES 



ALIGNMENTS 



RESULT 1 
AAD01174 

ID AAD01174 standard; cDNA; 3833 BP 
XX 

AC AAD01174; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Bovine neurite growth inhibitor Nogo cDNA. 
XX 

KW 

KW 

KW 

KW 



ceSral' Z lnhlbito *'- Nogo; neural cell; myelin; CNS; 

anttltt, nerVOUS * y stem; neoplastic disease; antiproliferative; glioma; 
antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 
degenerate nerve disease; Alzheimer's disease; Parkinson's disease; 



hyperproliferative disorder; benign dysprolif erative disorder; diagnosis 
psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 
structural plasticity; screening; ss. 



KW 
KW 
KW 
XX 

OS Bos sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 
PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 
PT 
PT 
XX 
PS 
XX 
CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

CC «n rn 11 " H ° W6Ver the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ 



Nogo proteins and nucleic acids useful for treating neoplastic disorders 
of the central nervous system and inducing regeneration of neurons. 

Claim 26; Fig 12; 122pp; English. 

The present sequence is a cDNA encoding bovine Nogo protein which is a 
potent neural cell growth inhibitor and is free of all central nervous 
system (CNS) myelin material with which it is natively associated The 
present sequence was obtained from bovine spinal cord white matter cDNA 
library. Nogo proteins and fragments displaying neurite growth inhibitory 
act lvlt y are used m the treatment of neoplastic disease of the CNS eg 
glioma, glioblastoma, medulloblastoma, craniopharyngioma, ependyoma, 
pinealoma, haemangioblastoma, acoustic neuroma, oligodendroglioma, 
menagioma, neuroblastoma or retinoblastoma and degenerative nerve 
diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 
promote Nogo activity can be used to treat or prevent hyperproliferative 
or benign dysprolif erative disorders e.g. psoriasis and tissue 
hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 
inhibit production of Nogo protein to induce regeneration of neurons or 
to promote structural plasticity of the CNS in disorders where neurite 
growth, regeneration or maintenance are deficient or desired. The animal 
models can be used in diagnostic and screening methods for predisposition 
to disorders and to screen for or test molecules which can treat or 
prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 
referred m claim 32 and SEQ ID NO: 29 in disclosure of the 



Sequence 3833 BP; 1235 A; 717 C; 818 G; 1063 T; 0 U; 0 Other; 

Query Match 100 .0%; Score 3833; DB3; Length 3833; 

Best Local Similarity 100.0%; Pred. No 0 ■ 

Matches 3833; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 
11 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I | | | || | | M I I I I I I I | | | | | | | | | | | | | | | 



1 qtATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 
61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCAT^ 120 

61 ^GCA^GA^^ "° 
121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 180 

;;: "° 

m ^GGA^icA^ - 
24! aatcctagggacgaaatagttgtgaggagtagaga™ga^ 300 

241 -^GA^iiGi^GAGlA ^ii^ii^i^ 300 

301 «CATCGTTGATACTCAGCAGGAGTTATCTACA^ 360 

301 i^ciicAiAC^AGG^ ™ 
361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 420 

h^^^ «• 

421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGC^ 480 

!— «■ 

481 AAAGATACTTAGAAGC^GATAGTGATGTTTTGATTGCTGGA^ 540 

541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACA^CAAATCGTGAA 600 
51 1 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGA^ 

601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGT 660 

601 AAAGATAGTGAAAGCAGTAATGATGACACT «» 
661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACT^GAATGTTT 720 

721 ACAAACATTTT T CCCTT GTT GGAAGAT CATACTT CGGAAAAT AAGAC AGAT GAAAAAAAG 780 

2 Er=^^ - 

781 MAGAAAAAAAAAGGCAGAAATTGTAACAGA^ •«<> 

78 1 ^gIIa^ "° 

841 CTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTGTCAA 900 



841 CTTTC 



Qy 901 AG GT GAC C GAGGAAGT AGT G GCAAACAT GC CT GAAGGT C T AAC C C C AGAT T T G GT T C AGG 960 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 901 AG GT GAC C GAGGAAGT AGT G G CAAAC AT GC C T GAAGGT C T AAC C C C AGAT T T G GT T C AG G 960 

Qy 961 AAG CAT GT GAAAGT GAAT T GAAT GAAGC T ACT G GT ACAAAAAT T G C C T T T GAAAC AAAAA 102 0 

I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 9 61 AAG CAT GT GAAAGT GAAT T GAAT GAAG C T AC T G GT AC AAAAAT T G C C T T T GAAAC AAAAA 102 0 

Qy 1021 TGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTTT 108 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 TGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTTT 108 0 

Qy 1081 GC C CAT C T T T T GAAGAAT CT GAAGCT AC T C C GT C AC CGGTTTTGCCT GAC AT T GT CAT GG 114 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I 

Db 1081 GC C CAT C T T T T GAAGAAT C T GAAGCT ACT C C GT C AC CGGTTTTGCCT GAC AT T GT CAT GG 114 0 

Qy 1141 AAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCAT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I 

Db 1141 AAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCAT 1200 

Qy 12 01 C AC CAT TAGAAACT CT T C CT T CAGT TAAT TAT GAAAG C ATAAAGT T T GAG C CT GAAAAT C 1260 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I i I I I I I I I I I I I I I I I 

Db 12 01 C AC CAT TAGAAACT CT T C C T T CAGT TAAT TAT GAAAGC ATAAAGT T T GAGC CT GAAAAT C 12 60 

Qy 12 61 C C C CAC CAT AT GAGGAGG C CAT GAAT GT AT C AC T AAAAAAAGAAT C AG GAAT GAAT GAAG 132 0 

I II I I I I I I I I I II I I I I i I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I M 

Db 12 61 CC CCAC CAT AT GAGGAGGCCAT GAAT GT AT CACTAAAAAAAGAAT CAGGAATGAAT GAAG 132 0 

Qy 1321 AAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATAT 138 0 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 AAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATAT 1380 

Qy 13 81 CT AT T GCAT GT GAT TT AATT AAAGAAAC AAAGAT CT CT ACT GAAC C GACT C C AGAT TT CT 144 0 

I I I I I I I I I I I I I I I II I I II I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 13 81 CT AT T GCAT GT GAT T TAATT AAAGAAAC AAAGAT C T C T AC T GAAC C GACT C C AGAT TT CT 144 0 

Qy 1441 C T AGT T ATT C AGAAAT AG C AGAAGT T GC ACAGC C AGT G C C C GAGC AT T C T GAG CT AGTT G 1500 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I t I I I I I II I I I I I I I I I I I I I I I I I I 

Db 14 41 C T AGT TAT T CAGAAATAGC AGAAGT T G CAC AG C CAGT GC C C GAGC AT T C T GAGCT AGT T G 1500 

Qy 1501 AAGAT TCCTCCCCC GAT T CT GAAC C AGT T GAC T TAT T T AGT GAT GAT T CAAT AC C C GAAG 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M II II I I I I I I I I I I I I I I I I II I I I II I I I I 

Db 1501 AAGAT TCCTCCCCC GAT T CT GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC C C GAAG 1560 

Qy 1561 TT C CACAAAAAC AAGAT GAAG C T GT AAT AC T T GT GAAAGAAAAC CT C ACT GAAAT T T CAT 1620 

II I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I II II I I I 

Db 1561 T T C CACAAAAACAAGAT GAAG C T GT AAT ACT T GT GAAAGAAAAC CT CAC T GAAAT T T CAT 1620 

Qy 1621 CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT C AGT GCTT CACCAT CACCT GAGG 1680 

I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1621 CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCTT CAC CAT CACCT GAGG 168 0 

Qy 1681 GAGGAAAAC C GT AT T T GG AGT CT T T T CAG C C CAGT T T AGG CAT C AC AAAAGAT AC CT T AG 1740 

II I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 1681 GAG GAAAAC C GT AT T T GG AGT CT T T T CAG C C CAGT T T AGG CAT C ACAAAAGAT AC C T TAG 1740 



Qy 1741 C AC CT GAT GAAGTT T C AGC AT T GAC C CAAAAG GAGAAAAT C C CT T T G C AGAT G GAG GAGC 18 0 0 

I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I II M I I 
Db 1741 CAC CT GAT GAAGT T T C AGC AT T GAC CC AAAAG GAGAAAAT C C C T T T G C AGAT GGAGGAG C 1800 

Qy 1801 T CAAT AC T G C AGT T TAT T C AAGT GAT G G CT T AT T CAT T GC T C AG GAAGCAAAC C TAAGAG 1860 

I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 T CAAT AC T G C AGT T TAT T C AAGT GAT G G C T TAT T CAT T G CT C AG GAAG C AAAC C TAAGAG 18 60 

Qy 1861 AAAGT GAAAC AT T T T C AGAT T CAT CT C C GAT T GAGAT TAT AGAT GAGT T C C C GAC CT T T G 1920 

I I I 1 I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 AAAGT GAAAC AT T T T C AGAT T CAT CT C C GAT T GAGAT TAT AGAT GAGT T C C CGAC CT T T G 1920 

Qy 1921 T C AGT T CT AAAGC AGAT T CT T C T C CT AC AT TAG C CAGGGAAT ACAC T GAC CT AGAAGT AG 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 T CAGTT CT AAAG C AGAT T CT T C T C CT AC AT TAG C CAGGGAAT ACAC T GAC C T AGAAGT AG 198 0 

Qy 1981 C C CACAAAAGT GAAAT T GCT GAC AT C C AGGAT GGAGCT GG GT CAT T G G CT T GT G C AGGAT 2040 

I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I 
Db 1981 C C CACAAAAGT GAAAT T GC T GAC AT C C AGGAT GGAGCT GGGT CAT TGGCTTGTG C AGGAT 2040 

Qy 2041 T GC C CCAT GAC C T T T C T TT CAAGAGT AT ACAAC C TAAAGAGGAAGT T CAT GTC C CAGAT G 2100 

I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I I 
Db 2041 T GC C CCAT GAC CT T T CT TT CAAGAGT AT ACAAC CTAAAGAGGAAGT T CAT GT C C CAGAT G 2100 

Qy 2101 AGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCTG 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2101 AGT T CT C CAAAGAT AGGGGT GAT GT T T C AAAGGT GC C C GT ACT GC C T C CAGAT GT T T C T G 2160 

Qy 2161 C T T T GGAT GCT CAAGC AGAGAT AG GC AGC AT AGAAAAAC C CAAAGT T C T T GT GAAAGAAG 222 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 2161 CT T T GGAT GCT CAAGC AGAGAT AG GC AGC AT AGAAAAAC C CAAAGT T C T T GT GAAAGAAG 2220 

Qy 2221 C C GAGAGAAAACT T C CT T CT GAT ACAGAAAAAGAGC GAAGAT CT C CAT CT GCT AT ATT T T 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I II I I I I I I I I I I I I I M I I I I I 
Db 2221 C C GAGAGAAAACT T C CT T CT GAT ACAGAAAAAGAGC GAAGAT CT C C AT CT GCT AT ATT T T 22 80 

Qy 22 81 CAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGA 2 34 0 

I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 CAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGA 234 0 

Qy 2341 CTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTG 2400 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I M II 
Db 2341 CTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTG 2400 

Qy 24 01 TGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATAT 2460 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 4 01 TGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATAT 24 60 

Qy 2461 AT AAG GGT GT GAT C C AG GCT AT C CAGAAAT CT GAT GAAG GC CAC C CAT T C AGG G CAT AT T 2520 

I I I I I I I I I I II I I I I I I I I I II I I I II II I I I I I I I I II II II I I I I I I I I I I I I I I I I 
Db 2461 AT AAG GGT GT GAT C C AGGC TAT C CAGAAAT CT GAT GAAG G C CAC C CAT T C AGG G CAT AT T 2520 

Qy 2521 TGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTG 258 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I II I I I II I I I I I I I I I I I I I 
Db 2521 TGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTG 2580 



Qy 



2581 GT CAT GT T AAC T GC AC AATAAAAGAACT C AGAC GC C T CT T CT T AGT T GAT GAT T T AGT T G 2 64 0 



Db 2581 GT CAT GT TAAC T GC ACAAT AAAAGAACT C AGAC GCCTCTTCT T AGT T GAT GAT T T AGT T G 2 640 

Qy 2641 ATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATG 2700 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I M I I I M I I I I 
Db 2 641 ATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATG 2700 

Qy 2701 GTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAAC 2760 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2701 GTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAAC 2760 

Qy 27 61 G G CAT C AGG C G CAAAT AGAT CAT TAT C T GGGAC TT GCAAAT AAGAAT GT TAAAGAT GCT A 2 82 0 

I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 2761 GGC AT C AGG C GCAAAT AGAT CAT T AT CT G G GACTT G CAAAT AAGAAT GT TAAAGAT GCT A 2820 

Qy 2821 T G GCT AAAAT C CAAGC AAAAAT C C C T GGAT T GAAG C GTAAAG C T GAAT GAGAAAGC CT GA 2 8 80 

I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I 

Db 2 821 T GGC T AAAAT C CAAG CAAAAAT C C CT GGAT T GAAGC GT AAAGCT GAAT GAGAAAG C CT GA 28 80 

Qy 2881 AAGAGTTAACAATAGAGGAGTTTAT CTTTAAAGGGGATATT CATTT GAT T C CATT GGGGA 2 94 0 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I II II I I I I I I I I II I I I I I I I I I I 

Db 28 81 AAGAGT T AACAAT AGAGGAGT T TAT CT T TAAAG GGGAT AT T CATT T GAT T C CAT T GGGGA 2 94 0 

Qy 2941 GGGT CAGGGAAGAACAAAGC CT T GAC AT T G C AGT GCAGT T T CACAGAT C T T TAT T T TT AG 3000 

II I I I I I I I I I I I I I I I I II I I I I I I M I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 941 GG GT CAGG GAAGAACAAAGC C T T GAC AT T GC AGT G CAGT TT CACAGAT CT T TAT T T TT AG 3000 

Qy 3001 CAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGT 3060 

I I II I I I I I I I I I I I I I I I I I I I I II Ml I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 3001 CAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGT 3060 

Qy 3061 AT T GT AAG CT GC T AT GT AT GGAT T T AAAT C GT AAT CAT AT T T GTT T T T C CT GT AT GAGGC 3120 

II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 

Db 3061 AT T GT AAGCT GCT AT GT AT GGAT T T AAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAGGC 3120 

Qy 3121 ACT GGT GAAT AAACAAAGAT CT GAGAAAGCT GT ATAT T AC ACT T T GT C GC AG GT AGTCT T 3180 

I I I I I I M I I I I I I I I I I I I II I I I I I 1 I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 3121 ACT GGT GAAT AAACAAAGAT C T GAGAAAG CT GT AT AT T AC ACT T T GT C GC AGGT AGT CT T 3180 

Qy 3181 GCT GT AT T T G G G GAAT T GCAAAGAAAGT GGAGCT GACAGAAATAAC C CTT T T C AC AGT T T 324 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 3181 GCT GT AT T T GGGGAAT T GCAAAGAAAGT G GAG C T GACAGAAATAAC C C TT T T C AC AGT T T 324 0 

Qy 3241 GTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACG 3300 

I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3241 GT G C AC T GT GT AC GGT CT GT GT AG GT T G AT GCAGATT T T CT GAAAT GAAAT GTT TAG AC G 3300 

Qy 3301 AGAT CAT GC C AC CAAG GCAGGAGT GAAAAAGC TTGCCTTTCCT GGT AT GT T C TAG GT GT A 3360 

I I II I I M I I I I I I I I I I I I I II II I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I II 
Db 3301 AGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTA 3360 

Qy 3361 TT GT GAAAT TTACT GTT GTAT TAATT GCCAATATAAGTAAATATAGATT AT AT ATATCTA 3420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 3361 TT GT GAAAT TTACT GTT GTAT TAATT GC CAATATAAGTAAATATAGATTATAT AT ATCT A 342 0 

Qy 3421 TATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATA 34 8 0 

I I I II II I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 3421 T AT AT AGT GT T T C AC GAAGC T T AGC C C T T T AC C T T C C C AG CT G C C C C AC AGT G C T T GAT A 3480 

Qy 34 81 CTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACT 354 0 

1 I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I M I I I | | I | | I I 
Db 34 81 CTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACT 354 0 

Qy 3541 TCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCCTCA 3600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I II 
Db 3541 T CT AGG C G CAC T AC CAT C T GT T T T CAAC AC GAAC C GAC G C CAT GCAAAC AGAACT C C T C A 3600 

Qy 3601 AC AT AAAC T T C ACT G CAC AGACT TACT GT AGT T AATT T TAT C ACAAACT CT GGACT GAAT 3660 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I I I I I I I M I I I 
Db 3601 AC ATAAACT T CAC T GC AC AGAC T T ACT GTAGT TAATT T TAT C ACAAAC T C T GGAC T GAAT 3660 

Qy 3661 CTAAT GC T T C CAAAAAT GT T T GCAAAT AT CAAAC ATT GT T AT GTAAGAAAAT AT AAAT GA 372 0 

I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I | I | | | || | | 
Db 3661 CTAAT G CT T C CAAAAAT GT T T GCAAAT AT CAAAC ATT GT TAT GTAAGAAAAT AT AAAT GA 372 0 

Qy 3721 C GAT T T AT ACAAT T GT GGT T T AAG C T GT AT T GAAC TAAAT C T GT G GAAT G CAT T GT GAAC 378 0 

I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I II I I I I I II I I 
Db 3721 C GAT T TAT ACAAT T GT G GT T TAAGCT GT AT T GAACT AAAT CT GT GGAAT GCAT T GT GAAC 3780 

Qy 37 81 T GT AAAAG CAAAGT AT CAAT AAAGC T TAT AGACT TAAAAAAAAAAAAAAAAAA 3833 

I II II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 37 81 T GT AAAAGCAAAGT AT CAAT AAAGCT TAT AGACT TAAAAAAAAAAAAAAAAAA 3833 



RESULT 2 
ABS70449 

ID ABS70449 standard; cDNA; 4822 BP. 
XX 

AC ABS70449; 
XX 

DT 27-NOV-2002 (first entry) 
XX 

DE Human bone remodelling gene #106. 
XX 

KW Bone remodelling; osteoporosis; human; gene; ss. 
XX 

OS Homo sapiens . 
XX 

PN US6426186-B1. 
XX 

PD 30-JUL-2002. 
XX 

PF 18-JAN-2000; 2000US-00484970 . 
XX 

PR 18-JAN-2000; 2000US-00484970 . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Jones KA, Volkmuth W, Walker MG; 
XX 

DR WPI; 2002-673014/72. 
XX 

PT A combination of polynucleotides which are co-expressed with genes known 

PT to be involved in bone remodeling and osteoporosis are useful in an array 



PT for the diagnosis of bone remodeling and osteoporosis associated 

PT disorders. 

XX 

PS Claim 1; Col 283-288; 206pp; English. 
XX 

CC The invention relates to a combination comprising a number of 

CC substantially purified and isolated polynucleotides which are co- 

CC expressed with genes known to be involved in bone remodelling and 

CC osteoporosis. The invention is used to diagnose disorders associated with 

CC bone remodelling or osteoporosis. ABS70344-ABS7 0512 represent human bone 

CC remodelling genes of the invention 

XX 

SQ Sequence 4822 BP; 1441 A; 1046 C; 1073 G; 1247 T; 0 U; 15 Other; 

Query Match 70.1%; Score 2687.8; DB 6; Length 4822; 

Best Local Similarity 85.6%; Pred. No. 0; 

Matches 3333; Conservative 0; Mismatches 4 58; Indels 104; Gaps 27; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 964 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 1023 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 

III I I I I I I I I I I I II II I I I I I II II I I I M I I I I I I I I II I I I I I I 

Db 102 4 GT AT T AC C CAC T GAAGGAAC AC T T CAAGAAAAT GT CAGT GAAG CT T CT AAAGAGGT CT C A 1083 

Qy 121 G AGAAG G C AAAAAAT C CAT T T GT AG AGAGAAAT T T AAC AGAAT T T T C AGAAT T G G AAT AT 18 0 

I I I I I I I I I I I I I I I I I I I I I III I I I I II I M I I I I I I I I I I I I I I I I I 
Db 1084 GAGAAGGCAAAAACT CT ACT CAT AGAT AGAG AT T T AAC AGAGT T T T C AGAAT T AGAAT AC 1143 

Qy 181 T C AGAAAT GGAAT CAT CAT T CAGT GG CT CT CAAAAG GC AGAAC CT GC C GT AAC AGTAGC G 24 0 

I I I I I I I I I I I I I I I I I I II I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I 
Db 1144 T C AGAAAT G G GAT CAT C GT T CAGT GT CT C T CCAAAAGC AGAAT CT G C C GTAAT AGTAGC A 1203 

Qy 241 AAT C CT AGGGAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAGGACT TAGT TAGT CTT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1204 AAT C CTAGGGAAGAAAT AAT C GT GAAAAATAAAGAT GAAGAAGAGAAGTT AGT TAGT AAT 1263 

Qy 301 AACAT C C T T CAT AC T C AGC AGGAGT T AT CT AC AGT CC T T AC GAAAT CAGT T GAAGAAGAA 360 

I I I I I I I I I I I I I 111 II I I I I I I I I I I I I I I I I I I I I I III I I II II 

Db 1264 AACAT C C T T CAT AAT CAACAAGAGT T AC CT AC AG C T CT T AC T AAAT T G GT T AAAG AGGAT 1323 

Qy 361 GAT AGAGT T CT GT C T C C AGAAAAAACAAAG GAC AGT T T T AAGGAAAAGGGAGT T GC AGC A 42 0 

II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I II I I I I M I 

Db 1324 GA AGT T GT GT C T T C AGAAAAAG CAAAAGACAGTT T T AAT GAAAAG AGAGT T GCAGT G 1380 

Qy 421 GAAG CT T C TAT GG GGGAGGAAT AT G C AGACT T CAAAC CAT T T GAG C GAGT AT G G GAAGT G 48 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1381 GAAGCT C C TAT GAGG GAGGAAT AT GC AGAC T T CAAAC C AT TT GAG C GAGT AT GGGAAGT G 14 4 0 

Qy 4 81 AAAGAT ACT T AC AAG CAAGAT AGT GAT GT T T T GAT T GCT GG AGGT AAT AT AGAGAG CAAA 54 0 

I I I I I I I I III I I I I I II I I I I I III I I I I I I I I I I I I II I I I I I I I I 
Db 1441 AAAGATAGT AAG GAAGAT AGT GAT AT GTTGGCTGCTG GAG GT AAAAT C GAGAG C AAC 14 97 



Qy 

Db 



541 
1498 



600 
1557 



Qy 601 AAAGAT AGT GAAAGCAGT AAT GAT GAC AC T T CAT T T C C C AGTACAC C AGAAG C T GT AAGA 660 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I II I I I I I I I I I II I I III 
Db 1558 AAAGAT AGT GAGAGT AGTAAT GAT GAT AC TTCTTTCCC C AGT AC G C C AGAAG GT AT AAAG 1617 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAAT GT T 717 

I I I I I I I I II I II I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I II 

Db 1618 GAT C GT T CAGGAGC AT AT AT C AC AT GT GCTCCCTT T AAC C CAG C AG CAACT GAGAGC AT T 1677 

Qy 718 T CAACAAAC AT TTTTCCCTTGTT GGAAGAT CAT AC TT C GGAAAATAAGAC AG AT GAAAAA 777 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 167 8 GCAAC AAACAT TTTTCCTTTGT TAG GAGAT C C T ACTT C AGAAAAT AAGAC C GAT GAAAAA 1737 

Qy 778 AAGA — T AGAAAAAAAAAGGCACAAAT T GT AAC AGAGAAGAAT G CAAGT GT C AAGAC AT C 835 

II I I I I II I I I I I I I I II I I I I I I I I II I I I I I I II III I I I I I 

Db 1738 AAAAAT AGAAGAAAAGAAG GC C CAAAT AGT AAC AGAGAAGAAT ACT AGC AC CAAAAC AT C 17 97 

Qy 836 AAAC C C- T TT CCTTAT GGCAGCACAGGAGT CTAAGACAGAT TAC GTT ACAACAGAT CAT G 8 94 

I I I I I I III I II I I I I I I I I I I I I III I I I I I I I I I I II I I I I I I I I I II 
Db 1798 AAAC CCTTTTACTT GTAGCAGCACAGGATT CT GAGACAGATTAT GT CACAACAGATAAT T 1857 

Qy 8 95 TGTCAAAGGT GAC CGAGGAAGT AGT GGC AAACAT GCCTGAAGGTCTAACCCCAGATTTGG 954 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I II I I II II I II I M I I I 
Db 1858 TAACAAAGGT GACT GAGGAAGT C GT GG C AAAC AT G C CT GAAGGC CT GAC T C C AGAT T TAG 1917 

Qy 955 TT C AG GAAGC AT GT GAAAGT GAATT GAAT GAAG C T ACT G GT ACAAAAAT T GC CT T T GAAA 1014 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 1918 T ACAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGT TACT GGT ACAAAGAT T G CTT AT GAAA 1977 

Qy 1015 CAAAAATGGACCTGGTTCJ^AACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACAC 1074 

I I II I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I II MM I III 
Db 1978 C AAAAAT GGACTT GGTT CAAACAT CAGAAGTTAT GCAAGAGT CACT CTAT C CT GCAGCAC 2037 

Qy 1075 AGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTG 1134 

I I I II I II I I I II II I I I I I I II M II II II I I I I I I I I I I II I I II M I II I II 
Db 2038 AGCT T T G C C CAT CAT T T GAAGAGT C AGAAGCT ACT C CTT C AC C AGT T T T G C CT GACATT G 2 097 

Qy 1135 TCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCA 1194 

I I I I I I I I I I I I I II I II I I II I I II I I I M I I I I II I I I I II I I II II II 
Db 2098 TTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCA 2157 

Qy 1195 GT T CAT C AC CAT T AGAAACT CT T C C T T C AGT TAAT TAT GAAAGC AT AAAGT T T GAG C CT G 1254 

I I I I I I I I II I I I II I I I I I I I I I I II II I I I II I II I II I I I I M I I I I I 

Db 2158 GCT C AT C AC CAT TAGAAGC TT C T T C AGT TAAT TAT GAAAGC AT AAAAC AT GAG C CT G 2214 

Qy 12 55 AAAAT C C C C C AC CAT AT GAGGAGG C CAT GAAT GT AT CACT AAAAAAAGAAT CAG GAAT GA 1314 

II I I I II I I II I I II I I I I I I I I I I I I I M II I I II I I I I I II I I I I II II I I I I 

Db 2215 AAAAC C C C C C AC CAT AT GAAGAGGC CAT G AGT GT AT C AC T AAAAAAAGT AT C AGG AAT AA 2274 

Qy 1315 AT GAAGAAAT CAC AGAG C CT GAAGGT AT T AGT GT AG CT GTT CAG GAAAC AGAAG CT C C T T 1374 

I I I M I II I I I I I I II I I II Mill II I I I I MM M II II I M II M I I I 
Db 22 75 AGGAAGAAAT T AAAGAG C C T GAAAAT AT TAAT GCAGC T CTT CAAGAAAC AGAAGC T C CT T 2334 

Qy 1375 AT AT AT C TAT T G CAT GT GAT T TAAT TAAAGAAAC AAAGAT C T C TAC T G AAC C GAC T C CAG 1434 

I M M II II II I I II II II I I I M || I || I I I II I I II I II I I I II I I I II I I I 
Db 2335 ATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGG 2394 



Qy 1435 AT T T CT CT AGT T AT T CAGAAAT AGC AGAAGT T GC ACAG C CAGT G C C C GAG CAT T C T GAG C 1494 

MINIM I I I I I I I I I I I I III I I I I I I I I I I M I I I I I I II I I I I I I I I I I 
Db 2395 AT T T C T C T GAT TAT T CAGAAAT GGCAAAAGT T GAAC AGC CAGT G C CT GAT CAT T C T GAG C 2454 

Qy 1495 T AGT T GAAGAT TCCTCCCCC GAT T C T GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC 1554 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | | | | I I | | M | | M M | | | | 

Db 2 455 T AGT T GAAGAT T C CT CAC CT GAT T CT GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC 2514 

Qy 1555 C C GAAGT T C C ACAAAAACAAGAT GAAG C T GT AAT ACT T GT GAAAGAAAAC CT C ACT GAAA 1614 

I M I I I I I II I I I I MINI II I I I I I I I I I I I I I I I I I I I I I I 

Db 2515 CT GACGTT C CACAAAAACAAGAT GAAACT GT GAT GCTT GT GAAAGAAAGT CT CACT GAGA 2574 

Qy 1615 TTT CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCT T CAC CAT CAC 1674 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I | Ml 
Db 2575 CTT CATT T GAGT CAAT GATAGAATAT GAAAATAAGGAAAAACT CAGT GCTTTGC CAC 2631 

Qy 1675 CT GAGG GAG GAAAAC C GT AT T T GGAGT CT TTT C AGC C CAGT T T AGG C AT C ACAAAAGAT A 1734 

I I I I I I M I I I I I II I I I I I I I I I I I I I I III I I I I I I I I I I M I I I I I I I I 
Db 2 632 CT GAGGGAGGAAAGC CAT AT T T G GAAT CT T T T AAGC T C AGT T TAGATAAC ACAAAAGAT A 2 691 

Qy 1735 CCT T AG CAC CT GAT GAAGT T T C AGC AT T GAC C CAAAAG GAGAAAAT C C CT T T GC AGAT G G 17 94 

M I I I I I I I I I I I I I I I II I! I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 2 692 C C C T GT T AC C T GAT GAAGT T T CAAC ATT GAGC AAAAAGGAGAAAATT C CT T T GC AGAT GG 2751 

Qy 1795 AGGAGCTCAATACTGCAGTT TAT TCAAGT GAT GGCT TAT T CATT GCT CAGGAAGCAAACC 1854 

I I I I l I I I I I I I I I I I I I I I I I I I I I I I I I I MINI III II I I I I M I I I 
Db 2 752 AGGAGC T CAGT ACT GCAGT T TAT T C AAAT GAT GACT T AT T TAT T T CTAAG GAAGCACAGA 2811 

Qy 1855 TAAGAGAAAGT GAAACAT TTT C AGAT T CAT C T C C GAT T GAGAT TAT AGAT GAGT T C C C G A 1914 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I 
Db 2 812 TAAGAGAAAC T GAAAC GT T T T CAGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT TC C CT A 2 871 

Qy 1915 C C TT T GT CAGT T CTAAAGC AGAT T CT T CT C CT AC AT T AGC CAG G GAAT AC ACT GAC CT AG 1974 

I M I I I I I I I M M I I I I I I I I III I I II I I I I I I I I I I I I I I I I I II I I 
Db 2872 CATT GAT CAGT T CT AAAACT GAT T CAT T T T CT AAAT T AGC C AGGGAAT AT ACT GAC CT AG 2931 

Qy 1975 AAGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTG 2034 

I I I I I I M I I M II II I I II I I II I I II I I I I I I I I I M I I I I II M I I I I 
Db 2932 AAGT AT C C C ACAAAAGT GAAAT T GCT AAT GC C C C GGAT GGAGCT G G GT CAT T GC CT T GC A 2991 

Qy 2035 C AGGAT T GC C C CAT GAC CT T T CTT T CAAG AGT ATACAAC C TAAAGAG GAAGT TC 2 088 

I M I I II I I I I I I I I I I I II I I II I I I I I I I I I I I I I Mill II I 

Db 2992 C AGAAT T GC C C CAT GAC CTTTCTTT GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT C A 3051 

Qy 2 08 9 AT GT CC CAGAT GAGTT CT C CAAAGATAGGGGT GAT GTT T CAAAGGT GCCC GTACT GCCT C 2148 

I I I M II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I 
Db 3052 GTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTC 3111 

Qy 214 9 CAGAT GTTTCTGCTTT GGAT GCT CAAGC AGAGAT AG G CAGCAT AGAAAAAC C CAAAGT T C 2208 

I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I || I I I | | | I I II I I I 
Db 3112 CAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTC 3171 

Qy 2209 TT GT GAAAGAAGCCGAGAGAAAACTT CCTT CT GAT ACAGAAAAAGAGC GAAGAT CT CCAT 2268 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I II I I 
Db 3172 T T GT GAAAGAAG C T GAGAAAAAAC TTCCTTCC GAT ACAGAAAAAGAG GAC AGAT CAC CAT 3231 

Qy 22 69 CT G C TAT AT TTT CAG CAGAGC T GAG - T AAAACT T C AGT T GTT GAC C T C CT C TACT G GAGA 2327 



Db 3232 CTGCTATATTTTCAGCAGAGCTGAGCTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGA 3291 

Qy 232 8 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACA 238 7 

I I I I I I I II I I I I I I I I I I M I II M I I I I I I I I I 1 I II I I II I I I I I II I I I II 
Db 3292 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACA 3351 

Qy 238 8 GTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATC 2447 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III 
Db 3352 GT AT T CAGC AT T GT GAG C GT AACAG C C T AC AT TGCCTTGGCCCTGCTCTCTGT GAC CAT C 3411 

Qy 2448 AG CT T TAG GAT AT ATAAGGGT GT GAT C C AGG C TAT C C AGAAAT CT GAT GAAG G C C AC C C A 2507 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 3412 AGCT T TAG GAT AT ACAAG G GT GT GAT C C AAG C TAT C C AGAAAT C AGAT GAAGGC C ACC C A 3471 

Qy 2508 TT C AG GG CAT AT T T GGAAT C T GAAGT T G C TAT AT CT GAGGAGTT GGT T C AGAAGT AC AG C 2567 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3472 TT CAG GG C AT AT CT G GAAT C T GAAGT T GCT AT AT CT GAGGAGTT G GT T CAGAAGT AC AGT 3531 

Qy 2568 AATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTT 2627 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINIM I II II II I II II I I I 

Db 3532 AATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTT 3591 

Qy 262 8 GAT GAT T TAGT T GAT T CT CT GAAGT TTGCAGTGTT GAT GTGGGTATTTACCTATGTT GGT 2687 

II I II II II I II II I I II II II I II II II I I II M I II II I II I II II II II II II I I I I 

Db 3592 GAT GATT TAGT T GATT C T C T GAAGT T T G C AGT GT T GAT GT GGGT AT T T AC CT AT GT T GGT 3651 

Qy 2688 GCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCT 2747 

II I I II II II II I I I I II II II II I II I I II I I I I I I I II I II I II I II I I I II I I I 
Db 3652 GCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCT 3711 

Qy 274 8 GT TAT T TAT GAAC G G CAT CAG G C GCAAAT AGAT CAT TAT CT GGGAC T T GCAAAT AAGAAT 2807 

I II I I II II I II II II II II II I M II II I I I I I II M I I I II II II I I II I II M I 
Db 3712 GT TAT T TAT GAAC GGC AT C AGG CAC AGAT AGAT CAT TAT CT AGGACT T G CAAAT AAGAAT 3771 

Qy 28 08 GT TAAAGAT G CT AT GGCTAAAAT C CAAGC AAAAAT C C CT G GAT T GAAGC GT AAAGCT GAA 2 867 

I I II II II II I II II I I I I II II II M M M I II I I I II I I I I I II I I I I M II II I II 

Db 3772 GT TAAAGAT G C TAT GGCTAAAAT C CAAGCAAAAAT C C CT GGAT T GAAGC GCAAAG CT GAA 3 831 

Qy 2868 T GAGAAAG C C T GAAAGAGT T AACAAT AGAG GAGT T T AT CT T T AAAG G GGAT AT T CAT T T G 2 927 

II I II I II I I I I II I II I II II II II I II II II I II I I I II II I II 

Db 3832 T G AAAAC G C C C AAAAT AAT T A GT AGGAGT T C AT CT T T AAAGG G GAT AT T C ATT T G 388 6 

Qy 292 8 AT T C CAT T GGGGAGGGT CAGG GAAGAAC AAAG C CT T GAC AT T GCAGT G C AGT T T CAC 2 984 

III I I II II II II I I II I I II I M II I II II II I II I II I II I II I II II 

Db 3887 ATTATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGA 3945 

Qy 2985 AGAT CT T TAT T T T TAGCAAC GCAGT G - T C T GAGGAAAAAT GAC CT GT CT T GA 3035 

I I I II II I I II I II I II I III II I I I I II II I II I I I I I II M I I I 
Db 3946 TCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGA 4005 

Qy 3036 CTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAAT 3095 

I I II I I I II I I II II II I II I I I I II I I II I I II I I I I I I II I II I I II I II I II I II 

Db 4006 C T G C CAT GT GT T CAT C AT CT TAAGT AT T GT AAGC T G C TAT GT AT G GAT T TAAAC CGTAAT 4 065 

Qy 3096 CAT AT TTGTTTTTCCT GT AT GAGG CAC T G GT GAAT AAACAAAGAT CT GAGAAAGCT GT AT 3155 

II II I I I I I II II I II II I I II I I I I I I I I M I I II I I I 



Db 



4066 CATA — TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTAT 4112 



Qy 3156 ATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

III I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I 
Db 4113 ATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4172 

Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGT 3266 

I I I I I II I II I I I II I I I I I I I I I I I II II I I I I I I I I I 
Db 4173 AGAAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGAT 4232 

Qy 3267 T GAT G C AGAT T T T CT GAAAT GAAA T GTTTAGACGAGAT CAT GC CAC CAAGGCAGGA 3322 

I I I I I I II II II I I I I I I I I I I I I I I II I I I I I I I II I I I I || || I II I I I 

Db 4233 T GAT GC AGAT T T T CT GAAAT GAAAT GT T T GT T TAGAC GAGAT CAT AC C G GT AAAG CAG GA 4292 

Qy 3323 GTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATT 3382 

III I I I I I I I I MM I II I II I I I I I I I I I I I I I II I I II I I I I I II M II I I 
Db 42 93 ATGACAAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATT 4351 

Qy 3383 AATT GC CAAT ATAAGTAAAT AT AGAT TAT AT AT AT CT AT AT AT AGT GT T T CAC GAAGC T T 3442 

I I I I I I II I II I I I I I I I I I M I I I I II I I I I I I I I I I I I II II I I I I I M I 

Db 4352 AAT T GC CAAT ATAAGTAAAT AT AGAT TAT AT AT G TAT AG T GT T T CAC AAAG C T T 4405 

Qy 3443 AGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTT 34 96 

I I I I I I I I II II I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 44 06 AGAC C T T T AC C TT - C CAGC C AC C C CAC AGT GCT T GAT ATT T C AGAGT C AGT CAT T GGT T A 44 64 

Qy 34 97 TATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCA 3556 

II I I I I M II I I I I I I I I I II I I I II I I I I II I II I M I II I I I I M I 

Db 4465 TACAT GT GT AGTT CCAAAGCACATAAGCT AGAAGAAGAAAT ATTT CTAGGAGCACTAC CA 4524 

Qy 3557 T CT GT T T T CAACAC GAAC C GAC G C CAT GCAAACAGAACT C C - T CAAC AT AAACT T C ACT G 3615 

I I II I I I I II I I I II I I II I I II I I I I I I I II II I I I I M II II II 
Db 4525 TCTGTTTTCAACATGAA AT GC CAC AC AC AT AGAACT C CAAC AAC AT CAAT T T CAT T G 4581 

Qy 3616 C AC AGACT T ACT GT AGT TAATT T TAT CAC — AAACT CTGGACTGAATCTAAT GCT TCCAA 3673 

I I I I I I I I I I I I I I I I II I II II I I I I II M II II I I I I I I I I II I I I I I I I I I 
Db 4582 CACAGACT GACTGT AGTT AAT TTT GT CACAGAAT CTAT GGACT GAAT CTAAT GCTT CCAA 4 641 

Qy 3674 AAA T GTTT GCAAATAT CAAAC ATT GT TAT GTAAGAAAATAT AAAT 3718 

Ml I I I I II I I I I II I I I I II I I I II I II I I I I I I I Ml I I I I 

Db 4 642 AAAT GT T GTTT GTTT GCAAATAT CAAACATT GTTAT GCAAGAAATT AT TAATT ACAAAAT 4 7 01 

Qy 3719 GAC GAT T T AT ACAAT T GT GGT T T AAGCT GT AT T GAACT AAAT C T GT G GAAT GC AT T GT GA 3778 

M II I I I I I I I I II I I I I II II I I I I I I I I I II I I II I I 1 I II I I I I II II I I I I I I 
Db 47 02 GAAGAT T TAT AC CAT T GT G GT T TAAG C T GT ACT GAAC T AAAT CT GT GGAAT G CAT T GT GA 4761 

Qy 3779 ACT GT AAAAG C AAAGT AT CAAT AAAGCT TAT AGAC T T AAAAAAAAAAAAAAAAAA 38 33 

I I II I I II I I I II I I I I II I I I I I I II I I I I II I I I I II I I I I II I 

Db 47 62 AC T GTAAAAGC AAAGT AT CAAT AAAG CT TAT AGAC T TAAAAN GNMANNAGN GAAA 4816 
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XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 
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CC residues of (a); (d) sequences that hybridize to (a), under moderately 

CC stringent conditions; (e) sequences having at least 75% or 90% identity 

CC to (a); or (f) degenerate variants of (a). Polypeptides (ABP68596- 

CC ABP68637) encoded by (I) and oligonucleotide can be used to detect cancer 

CC in a patient and compositions comprising polypeptides, polynucleotides, 

CC antibodies, fusion proteins, T cell populations and antigen presenting 

CC cells expressing the polypeptide are useful in treating pancreatic cancer 

CC and stimulating an immune response. The polynucleotides can be used as 

CC probes or primers for nucleic acid hybridisation, in the design and 

CC preparation of ribozyme molecules for inhibiting expression of the tumour 

CC polypeptides and proteins in the tumour cells, in vaccines and for gene 

CC therapy. Note: The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 
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Query Match 69.1%; Score 2649.8; DB 6; Length 4632; 

Best Local Similarity 85.0%; Pred. No. 0; 

Matches 3292; Conservative 0; Mismatches 442; Indels 137; Gaps 24; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II II 
Db 853 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGT^ATACCTTGGTAATTTGTCAACA 912 

Qy 61 GT AC T GC C CAC T GAAGGAAC AC T T C C AG CAACT T CAAAT GAAGCT T CT AAAG C ATT CT C A 120 

Ml I I I I I I I I I I M I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 
Db 913 GT AT T AC C CAC T GAAGGAACACT T CAAGAAAAT GT CAGT GAAG C T T CT AAAGAGGT CT C A 972 

Qy 121 GAGAAGGCAAAAAAT CCATTT GT AGAGAGAAATTTAACAGAATTT T CAGAAT T GGAAT AT 180 

I I I I I I I I I I I I I II I I I I I I III I I I I I I II I I I I I II I I I I I I I I I II 
Db 97 3 GAGAAGGCAAAAACT CTACT CATAGATAGAGATTTAACAGAGT T TT C AGAATTAGAATAC 1032 

Qy 181 T CAGAAAT GGAAT CAT CAT T CAGT G GC T C T CAAAAGG C AGAAC C T GCC GTAACAGT AGC G 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I 11 I I I I I I I I I MINI 
Db 1033 T CAGAAAT G G GAT CAT C GT T CAGT GT CT CT C CAAAAGC AGAAT CT G C C GT AAT AGTAG C A 1092 

Qy 241 AAT C C T AGGGAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAG GACT T AGT T AGT CT T 300 

I I II I I I I I I I I I I I I I I I I II I || I I I | M I I I I I I I I I I I I II II I 
Db 1093 AAT CCTAGGGAAGAAATAAT C GT GAAAAATAAAGAT GAAGAAGAGAAGTTAGTT AGTAAT 1152 

Qy 301 AAC AT C C T T CAT ACT CAGC AGGAGT T AT C T AC AGT C C T T AC GAAAT C AGT T GAAGAAGAA 360 

I I I I I I I I I I I I I III II I I I I I I MINI I I I I I I I I I Ml I I II II 

Db 1153 AAC AT C CT T CAT AAT CAACAAGAGT T AC CT AC AGC T CT T AC TAAAT T GGT TAAAGAGGAT 1212 

Qy 361 GAT AGAGT T CT GT CT C C AGAAAAAACAAAG GAC AGT T T TAAGGAAAAG G GAGT T G CAG C A 42 0 

II I I I I I I I I I I I I I I I I I I I I I 1 I I I I I M I I I I I I I I I I I I I I I I I I 

Db 1213 GA AGT T GT GT C T T CAGAAAAAG CAAAAGAC AGT T T T AAT GAAAAGAGAGT T GCAGT G 1269 

Qy 421 GAAG CTT CTAT GGGGGAGGAAT AT G C AGAC T T CAAAC CAT T T GAG C GAGT AT G G GAAGT G 480 

1)1(1) I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I 

Db 12 7 0 GAAGCT C CT AT GAG GGAGGAAT AT GCAGACTT CAAAC C ATT T GAGC GAGT AT GGGAAGT G 1329 

Qy 481 AAAGAT ACT T ACAAGCAAGAT AGT GAT GT T T T GAT T GC T GGAG GT AAT AT AG AGAG CAAA 54 0 

I I I I I M I III I I I II I I I I I I I III I I I M I I I I I I I II I I I I I I I I 
Db 1330 AAAGATAGT AAGGAAGATAGT GAT AT GTTGGCT GCT GGAGGTAAAAT C GAGAGCAAC 1386 

Qy 541 T T GGAAGGT AAAGT G GAT AAGAAACAC T T T T CAGAT AG C CT T GAACAAAC AAAT C GT GAA 600 

I I I I I I I I I II I I I I I I I I III III I I I I I I I I I I I I I I I II I I I I I III 
Db 1387 T T GGAAAGTAAAGT GGATAAAAAAT GT T T T GC AGAT AGC CTT GAG CAAAC T AAT CAC GAA 144 6 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GACAC T T CAT T T C C CAGT AC AC C AGAAG C T GTAAGA 660 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I II I I I I I I II I I III 
Db 1447 AAAGATAGT GAGAGT AGTAAT GAT GAT ACTT CT T T C C C CAGT AC GC C AGAAGGT AT AAAG 1506 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTG AGAAT GTT 717 

I I I M I I I II I II I I I I I I I I II I I I I I I I I I I I II MINIMI II 
Db 1507 GAT C GT T CAG GAGC AT AT AT CAC AT GT GCTCCCTT T AAC C C AGCAGCAAC T GAGAGC AT T 1566 

Qy 718 T C AAC AAAC AT TTTTCCCTTGTTG GAAG AT CAT AC T T C G GAAAAT AAG AC AG AT G AAAAA 777 

I I I M I I M I I I II I I I II II I II II I I II II I II I I II II I II M I I I I I I I 
Db 1567 G CAACAAAC AT TTTTCCTTT GT T AGGAGAT C CT AC TT C AGAAAAT AAGAC C GAT GAAAAA 1626 



Qy 77 8 AAGATAGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGACAT CA 836 

II I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I III I I I I I I 
Db 1627 AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AAC AGAGAAGAATAC TAGC AC CAAAACAT C A 168 6 

Qy 837 AAC CCTTTCCT TAT G G C AG C AC AG GAGT C T AAG ACAGATT AC GT T AC AAC AGAT CAT GT G 8 96 

I I I I I M I I II I I I I I I II I I I I I I I I I I I II I I I I I II I I II I 

Db 168 7 AAC CCTTTTCTT GT AG C AG C AC AG GAT T C T GAGAC AG AT TAT GT C AC AAC AGAT AAT T T A 17 4 6 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 1747 ACAAAG GT GACT GAG GAAGT C GT G GCAAAC AT G C C T GAAG GC C T GAC T C CAGAT T T AGT A 1806 

Qy 957 C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGCT ACT GGT ACAAAAAT T G C C TT T GAAACA 1016 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1807 C AG GAAGCAT GT GAAAGT GAAT T GAAT GAAGT TACT GGT ACAAAGAT T GC T TAT GAAACA 18 66 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II MM I II I II 
Db 1867 AAAAT G GAC T T GGT T CAAAC AT C AGAAGTT AT G CAAGAGT CACT CT AT C CT G C AGCAC AG 192 6 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I M M I I I II I II I I II I I II II I I I I I II I I I II I I II I I M II II I I I I I I I 
Db 1927 CT T T GC C CAT CAT T T GAAGAGT CAGAAG CT ACT C C T T C AC CAGT T T T GC CT GAC AT T GT T 198 6 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 119 6 

I II I II I M II I II I II I I I I I II I I II I M M I I II II II I I I I I II III 
Db 1987 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 04 6 

Qy 1197 T CAT C AC CATT AGAAACT CT T C CT T CAGT TAAT TAT GAAAGCAT AAAGTT T GAGC C T GAA 1256 

I II II I II I II I II I I I I I II I M II 1 I I II I II II I II I I I II I II I I I I I 

Db 2047 T CAT CAC CATT AGAAGC T T CT T CAGTT AAT TAT GAAAGC ATAAAAC AT GAG C CT GAA 2103 

Qy 1257 AAT C C C C CACC AT AT GAG GAGG CC AT GAAT GT AT CACT AAAAAAAGAAT C AGGAAT GAAT 1316 

II II I II II I II I I I I I I I I I I I I II II I I M I I II I I I I I II I I I I I M I I II 

Db 2104 AACCCCCCACCATATGAAGAGGCCAT GAGT GT AT CACTAAAAAAAGT AT CAGGAATAAAG 2163 

Qy 1317 GAAGAAAT CACAGAGC C T GAAG GT ATT AGT GT AG CT GT T CAGGAAAC AGAAGC T C CT T AT 1376 

I M II I I I I II I I I I I I I I I I I II II I I I I I II I II II I I II II I I I II I II 
Db 2164 GAAGAAAT TAAAGAGC CT GAAAAT AT TAAT G C AGC T CT T CAAGAAAC AGAAG CT C C T TAT 2223 

Qy 1377 AT AT C T AT T G CAT GT GAT T TAAT T AAAG AAAC AAAGAT C T C T AC T GAAC C GAC T C CAGAT 1436 

I II II II I II II II II I II I I II I I II II I I I II I I I III I II I I II I I I I III 

Db 2224 AT AT CT AT T GC AT GT GAT T TAAT T AAAGAAACAAAGC T T T CT GCT GAAC C AG CT C C G GAT 2283 

Qy 1437 T T C T C T AGT TAT T CAGAAAT AGC AGAAGT T G CAC AGC CAGT GC C C GAG CAT T C T GAGC T A 14 96 

II I I I I I I I II II II I M III I I I I I I II II II II I I I I II II II II I I II I I 

Db 2284 T T C T CT GAT TAT T CAGAAAT G G CAAAAGT T GAAC AGC CAGT G C CT GAT CAT T CT GAG CT A 2343 

Qy 1497 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

I II I II I I II I II I II I II II I II II I I I I II II I I I M I I I II I I II II II I I II I 

Db 2344 GTT GAAGATT C CT CAC CT GATT CT GAAC CAGTT GACTTAT TTAGT GAT GATT GAAT AC CT 2403 

Qy 1557 GAAGT T C CACAAAAACAAGAT GAAG CT GT AAT ACT T GT GAAAGAAAAC C T CAC T GAAATT 1616 

II I I I I I II I I I II I II II I I I I I II I II I I I II I II I I I I I I I I I I I I I I I 

Db 24 04 GAC GTT C CACAAAAACAAGAT GAAACT GT GAT GCT T GT GAAAGAAAGT CT CACT GAGACT 24 63 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCT T CAC CAT CAC CT 1676 



Db 24 64 T CAT TT GAGT CAAT GATAGAATAT GAAAATAAGGAAAAACT CAGT GCTT T GC CACCT 252 0 

Qy 1677 GAG G GAG GAAAAC C GT AT T T GGAGT CT T T T C AG C C CAGT T TAG G CAT C AC AAAAGAT AC C 1736 

I I I I I I II I I I II I I I I I I I I I I I I I I III I I I II I I I I I II I I II I I I I I I 
Db 2521 GAGGGAGGAAAGC C AT AT T T GGAAT CT T T T AAG CT C AGT T TAGATAACAC AAAAGAT AC C 258 0 

Qy 1737 T TAG CAC C T GAT GAAGT T T CAG CAT T GAC C C AAAAG GAGAAAAT C C C T T T G C AGAT G GAG 1796 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I 
Db 2581 CT GT T AC CT GAT GAAGT T T CAAC AT T GAGCAAAAAGGAGAAAAT T C CT T T G C AGAT GGAG 2640 

Qy 1797 GAG C T CAAT ACT GCAGTT TAT T CAAGT GAT GG CT T AT T CAT T GCT C AGGAAG C AAAC C T A 1856 

I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I III II I I I I II I I I II 
Db 2 641 GAGC T CAGT ACT GC AGT T TAT T CAAAT GAT GACT TAT T TAT T T CT AAG GAAGC AC AGAT A 2700 

Qy 1857 AGAGAAAGT GAAAC AT T T T CAG AT T CAT C T C C GAT T GAGAT TAT AG AT GAGT T C C C GAC C 1916 

I I I I I I I II I I I I II II I I I I I I I I I M I I MM I I I I I I I I I I I I I I I I I || 

Db 27 01 AGAGAAACT GAAAC GTTTT CAGATT CAT CT C CAAT T GAAAT TAT AGAT GAGTT CCCT ACA 2760 

Qy 1917 T T T GT CAGT T CTAAAGC AGAT TCTTCTCC T AC AT T AGC CAGGGAAT AC AC T GAC C TAGAA 1976 

II I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2761 T T GAT CAGT T CT AAAACTGAT T CAT T T T C T AAAT TAG C CAGGGAAT AT ACT GAC CT AGAA 2820 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

Ml I I I I I I M I I I I I I I I I I I I I II I I I I II I I I I I I I I I M I I I I I I II 
Db 2821 GT AT CC C ACAAAAGT GAAAT T GCTAAT G C C CC GGAT G G AGC T G GGT CAT T G CCT T GCAC A 2880 

Qy 2 037 G GAT T G C C C CAT GAC CTTTCTTT C AAGAGT AT AC AAC C T AAAGAGGAAGT T CAT 2090 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 2 881 GAAT T GC C C CAT GAC CT TT CT T T GAAGAAC AT ACAAC C C AAAGTT GAAGAGAAAAT CAGT 2 94 0 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I. I I I I I 
Db 2941 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 300 0 

Qy 2151 GAT GTT T CT GC TT T GGAT GCT CAAG CAGAGAT AGGCAG C AT AGAAAAAC C CAAAGT T CT T 2210 

I I I I I I I I I I I I I I II I I II I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 3001 GAT GTTTCTGCTTTGGC C ACT CAAGCAGAGATAGAGAGCAT AGT T AAAC C CAAAGT T CT T 3060 

Qy 2211 GT GAAAGAAGCC GAGAGAAAACTT C CTT CT GATACAGAAAAAGAGC GAAGAT CT C CAT CT 2270 

I I II I I I II II I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I 
Db 3061 GT GAAAGAAGCT GAGAAAAAACTT CCTT CCGAT ACAGAAAAAGAGGACAGAT CAC CAT CT 312 0 

Qy 2271 GCT AT AT T T T CAGC AGAGC T GAGT AAAAC T T CAGT T GT T GAC C T C CT C TACT G GAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I II I II I I I I I I II II I I 
Db 3121 GCT AT AT T T T CAG C AGAGCT GAGTAAAAC T T C AGT T GTT GAC CT C C T GT AC T GGAGAGAC 3180 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I II I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3181 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3240 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3241 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGAC CAT CAGC 3300 

Qy 2451 T T TAG G ATAT AT AAGGGT GT GAT C CAG GC TAT C C AGAAAT CT GAT GAAG G C CACC C AT T C 2510 

I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 



Db 3301 T T T AGGAT AT ACAAGGGT GT GAT C C AAG CT AT C CAGAAAT C AGAT GAAGGC C AC CCAT T C 3360 

Qy 2511 AG GG CAT AT T T G GAAT C T GAAGT T GC T AT AT CT GAGG AGT T GGT T C AGAAGT AC AG C AAT 2570 

I I II I I I I I I I I I II I 1 II I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I II I I III 
Db 3361 AGG GCAT AT C T G GAAT CT GAAGT T G CT AT AT C T GAG GAGT T G GT T C AGAAGT AC AGT AAT 3420 

Qy 2571 TCTGCTCTTGGT CAT GT T AACT G CACAAT AAAAGAACT CAGAC GCCTCTTCT T AGT T GAT 2630 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I 
Db 3421 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3480 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 690 

I I II I II I I II I I II I I I I I I II I I I I I I I I I I I II II I II I I I I I I I I II II I I I I I I I 
Db 34 81 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 354 0 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 3541 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3600 

Qy 2751 AT T TAT GAAC G G CAT C AGGC GCAAAT AG AT CAT TAT CT GG GAC T T GCAAATAAGAAT GT T 2810 

I I I II I I I I I II I I I I II I I II II I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I 
Db 3601 ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3660 

Qy 2 811 AAAGAT G C TAT GGC TAAAAT C CAAGCAAAAAT C C CT G GAT T GAAGC GTAAAGCT GAAT GA 2 87 0 

I I II I I II I II I I I I I I I I I I I I I I I I II II I I I I I I I I I I II I I I I I I I I II I I I I I I 
Db 3661 AAAGAT GCT AT G GC TAAAAT C CAAGCAAAAAT C C C T GGAT T GAAGC GCAAAGCT GAAT GA 372 0 

Qy 2871 GAAAGC CT GAAAGAGT TAAC AAT AGAGGAGTT T AT CT T T AAAG GG GAT AT T C ATT T GAT T 2930 

Mill I I I I I I I I I I I I I I I I I I I i I I I I II I I I II M I I I I I I I I 

Db 3721 AAAC G C C C AAAAT AAT T AGT AG GAGT T CAT CT T T AAAG G GGAT AT T CAT T T GAT T 3775 

Qy 2931 C CAT TGG GG AG G GT C AGGGAAGAACAAAGC CT T GACAT T G CAGT G C AGT T T C AC AGAT CT 2990 

II I I I I I 

Db 3776 AT AC GGAT CT 37 8 5 

Qy 29 91 TTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCA 304 9 

I I I I I II I I I I I III II I II II II I I M I I I I I I I I I I I I I II I I MINIM 

Db 3786 T T AT TTT T AGC CAT G C ACT GT T GT GAGGAAAAAT T AC CT GT CT T GAC T G C CAT GT GTT C A 3845 

Qy 3050 T CAT CT T AAGT AT T GT AAGC T GCT AT GT AT GGAT T T AAAT C GT AAT CAT AT TTGTTTTTC 3109 

II I II II M II I M I I I M II I I II I II I II II I II I II I II I II M II I I I II I I 

Db 3846 T CAT CT T AAGT AT T GT AAG CT GC T AT GT AT GGAT T T AAAC C GT AAT CAT A — TCTTTTTC 3903 

Qy 3110 CT GT AT GAGGC ACT GGT GAATAAACAAAGAT C T GAGAAAG C T GT AT AT T AC ACT T T GT C G 3169 

II I II II II I II I I II I I I I II M I I II II I I I I II I I I 

Db 3904 CTATCTGAGGCACTGGTGGA AT AAAAAAC CT GT AT AT T T T AC T T T GT T G 3952 

Qy 3170 CAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAA 3222 

II I II I II II I I I II II M I II M I II I M II I I II I I I I I 

Db 3953 C AGAT AGT CTT G C C GCAT C T T G G C AAGT T G C AGAGAT G GT G GAGCT AGAAAAAAAAAAAA 4 012 

Qy 3223 TAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTG 32 82 

II I M I I I II II II I I I II II II II I I I II II I I II II II I I II II II M 
Db 4013 AAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTG 4072 

Qy 3283 AAAT G AAA T GT T T AGAC GAGAT CAT G C C AC C AAG G C AG GAGT GAAAAAG C T T G C C T 3338 

I II I I Ml I M II I II II I II I || I M II I I M I I II I I II II I I I M 

Db 4073 AAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C G GT AAAG C AG GAAT GACAAAG C T T G- C T 4131 



Qy 3339 T T C C T GGT AT GT T CT AGGT GT AT T GT GAAAT T TACT GT T GT AT T AAT T G C CAAT AT AAGT 3398 

II I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I II I II I I II | I I I 

Db 4132 TTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGT 4191 

Qy 33 99 AAATAT AGATT AT AT AT AT CT AT AT AT AGT GT T T CAC GAAGCT TAG C C C T T T AC C T T C C C 3458 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II 

Db 4192 AAATAT AGAT TAT AT AT G TAT AGT GT T T C AC AAAG C T T AGAC CT T T AC C T T - C C 4244 

Qy 3459 AGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCA 3512 

III I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 42 4 5 AG C CAC C C C ACAGT G CT T GAT AT T T C AGAGT CAGT CAT T GGT T AT ACAT GT GT AGT T C C A 4304 

Qy 3513 AAGCACATAAGCTAGGGAGAAAC GT ACTT CTAGGCGCACTACCAT CT GT TTT CAACAC GA 3572 

I I I I I I I I I I I I I I I I I II I I I II M I I I I I I I I I I I I I I I I I I I I I I II 

Db 4 305 AAGCACATAAGCTAGAAGAAGAAATATTTCTAGGAGCACTACCAT CT GTTTTCAACATGA 43 64 

Qy 3573 AC C GAC GC CAT GCAAAC AGAAC T C C - T CAAC AT AAACT T C ACT GCAC AGAC T TACT GT AG 3631 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I j I I I I I I I I I I I I 
Db 4365 A AT GC CACACACATAGAACT C CAACAACAT CAATT T CATT GCACAGACT GACT GT AG 4421 

Qy 3632 T T AAT T T TAT CAC — AAACT C T GGAC T GAAT CTAAT GCT T C CAAAAA TGTTTG 3682 

I I I I I II I I I I I I I I I II I I I I I I I II I I I I I I I I I I I II I I I I I I I I 

Db 4422 TT AAT TTT GT C ACAGAAT C TAT G GACT GAAT CTAAT GC T T C CAAAAAT GT T GT T T GT T T G 44 81 

Qy 3683 C AAATAT CAAACAT T GT TAT GT AAGAAAAT AT AAAT GAC GAT T TAT AC AAT T 3734 

II I I I I I I I I I I I I I I I I I I I MINI III I I I II I I I I I I I I I I III 

Db 4 4 82 C AAAT AT CAAACAT T GT TAT GCAAGAAAT TAT T AATT ACAAAAT GAAGAT T TAT AC CAT T 4541 

Qy 3735 GT G GT T TAAGC T GT AT T GAACT AAAT C T GT GGAAT GC AT T GT GAACT GTAAAAG CAAAGT 3794 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I II I I I II I I I I I I I I I I I II I I I 
Db 4542 GT G GT T T AAG CT GT ACT GAACTAAAT CT GT G GAAT G C ATT GT GAACT GT AAAAGCAAAGT 4601 

Qy 3795 AT CAATAAAGCTTAT AGACTTAAAAAAAAAA 3825 

I I I I I I II I I I I I I II I I I I I I II I I I I I I I 
Db 4 602 AT CAATAAAGCTTAT AGACTTAAAAAAAAAA 4 632 
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XX 

PT New purified disease detection and treatment molecule proteins and 
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XX 

PS Claim 1; SEQ ID NO 124; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecul 

CC polypeptide (MDDT) which has anti-inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic, 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotide 

CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia, psoriasis or 

CC hepatitis. ABX3444 0-ABX34 835 encode the MDDT polypeptides represented in 

CC ABU11450-ABU11845, described in the disclosure of the invention, NOTE: 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 

XX 

SQ Sequence 4698 BP; 1410 A; 1028 C; 1022 G; 1238 T; 0 U; 0 Other; 



Query Match 



67.0%; Score 2566.4; DB 7; Length 4698; 



Best Local Similarity 84.7%; Pred. No. 0; 

Matches 3221; Conservative 0; Mismatches 441; Indels 140; Gaps 25; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I II I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I II II 

Db 856 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 915 

Qy 61 GT ACT G C C C ACT GAAGGAAC AC T T C C AG- CAAC T T CAAAT GAAG C TT C T AAAGC AT T C T C 119 

III I I I I I I I I I I I I I I I I I II I II II I I I I I I I II I I I I I I I I II I 

Db 916 GT AT T AC C C AC T GAAG GAAC ACT T CAAGAAAAAT GT C AGT GAAGCTT CT AAAGAG GT C T C 975 

Qy 120 AGAGAAGGCAAAAAAT C CATTT GT AGAGAGAAATTTAACAGAATTTT CAGAAT T GGAAT A 179 

I I I I I I I I I I I I II II I I II I I III I II I I II I I I I I I I I I I I I I I II I I I 

Db 976 AGAGAAGGCAAAAAC T CT AC T C AT AGAT AGAGAT T T AAC AGAGT T T T CAGAAT T AGAAT A 1035 

Qy 180 TT CAGAAAT GGAAT CAT CAT TCAGT GGCT CT CAAAAGGCAGAACCTGC CGTAACAGT AGC 239 

I I I I I I I I I I I I I I I I I I I I I I I Mill III I I II I I I I I I I I I I I I I I I I I 
Db 1036 CT CAGAAAT GGGAT CAT C GT T C AGT GT C T C T C C AAAAG CAGAAT C T G C C GTAAT AGT AG C 1095 

Qy 24 0 GAAT C CT AGGGAC GAAATAGTT GT GAGGAGTAGAGATAAAGAAGAGGACTT AGTT AGT CT 299 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I 

Db 1096 AAAT C C TAG G GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AA 1155 

Qy 300 TAACAT C CT T CAT AC T CAG C AGGAGT TAT C T AC AGT C CT T AC GAAAT C AGT T GAAGAAGA 359 

II I I I II I I I I I I I III II I I I I I I I I I I I I I I I I I II I I III I I I I II 

Db 1156 TAACAT C CT T CAT AAT CAACAAGAGT T AC C T AC AGCT CTT AC T AAAT T GGT T AAAGAGG A 1215 

Qy 360 AGAT AGAGT T CT GT CT C C AGAAAAAAC AAAG GACAGT T TTAAG GAAAAG GGAGT T GC AG C 419 

II MM Mill II I I I II I MM I I II II II I I I I I I I I I II I I II II I 

Db 1216 TGA AGT TGTGTCTT C AGAAAAAG CAAAAG AC AGT T T T AAT GAAAAGAGAGT T GC AGT 1272 

Qy 420 AGAAG CT T C TAT GGG G GAG GAAT AT GC AGACT T CAAAC CATT T GAGC GAGT AT GGGAAGT 479 

I II I II II II I I II I I I II I I I II II I II II I II I I II I II II II II II I I I II I II 
Db 1273 G GAAGCT C CT AT GAGG GAGGAAT AT GCAGAC T T CAAAC CAT T T GAGC GAGT AT GGGAAGT 1332 

Qy 480 GAAAGAT ACTT ACAAGCAAGATAGT GAT GTTTT GATT GCT GGAGGTAAT ATAGAGAGCAA 53 9 

II II I I I I I III II I I II I I I II I III I II I I I I I I I I I II II I I I I II 

Db 1333 GAAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCTG GAGGTAAAAT C GAGAGC AA 138 9 

Qy 540 AT T G GAAG GTAAAGT GGATAAGAAAC AC T T T T C AGAT AGC CT T GAACAAACAAAT C GT GA 599 

. II I I I I I I I II I I II II II Ml III II I I I I I II I II I I I I I I II II II 
Db 1390 CT T G GAAAGT AAAGT GGATAAAAAAT GT T TT GC AGAT AGC CT T GAGC AAAC T AAT CAC GA 1449 

Qy 600 AAAAGATAGT GAAAGCAGTAAT GAT GACACTT CATTT C CCAGTACAC CAGAAGCT GTAAG 659 

I I I I I I I I I I I I II I I I I I I II I M I I I I I II I II I II I I II II I I I I III 
Db 1450 AAAAGATAGT GAGAGTAGTAAT GAT GAT ACTT CTTT CC CC AGT AC GC CAGAAGGT ATAAA 1509 

Qy 660 AGGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAAC AAC T GAGAAT GT 716 

I I I I I I Mill II Mill I I I I I I I I I I II II II II II II I II I I I 
Db 1510 G GAT C GT T CAGGAG CAT AT AT CAC AT GTGCTCCCTT T AAC C C AGCAG CAACT GAG AGC AT 1569 

Qy 717 T T C AACAAACAT TTTTCCCTTGTTG GAAGAT CAT AC T T C GGAAAATAAGAC AGAT GAAAA 776 

I I I M I II I I II I II I I I I I I I I Mill I I I I I I I I II II II II I I I I I I I I I 
Db 1570 T G C AACAAACAT TTTTCCTTTGT T AGGAGAT C C T ACT T CAGAAAATAAGAC C GAT GAAAA 1629 

Qy 777 AAAGAT AGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGACAT C 8 35 

III II II II I II I II I II II I II II I I I I I II I I I I II I II Ml II I II 



Db 1630 AAAAAT AGAAGAAAAGAAGGC C CAAATAGTAACAGAGAAGAAT ACTAGC AC CAAAACAT C 1689 

Qy 836 AAACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGT 8 95 

I I I I I I II I III I II I I I I I I II I Ml I I M I I I I I I II I M I I II II I I I 
Db 1690 AAAC C C T T T T CTT GT AG C AG C ACAGGAT T C T G AGACAGAT TAT GT C ACAACAG ATAAT T T 1749 

Qy 8 96 GTCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGT 955 

I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I II 
Db 1750 AACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGT 18 0 9 

Qy 956 T C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAG CT ACT GGT ACAAAAAT T G C CT T T GAAAC 1015 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1810 AC AGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT T ACT GGT ACAAAGAT T GC T TAT GAAAC 18 69 

Qy 1016 AAAAAT GGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACA 1075 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I II I II MM I MM 

Db 1870 AAAAAT GGACTTGGTT CAAACAT CAGAAGTT AT GCAAGAGT C ACT C TAT C CT GCAGCAC A 1929 

Qy 107 6 GCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGT 1135 

II M II II M II II II II II M I II M M II II I II II II II II II I M II I I II 

Db 193 0 GCTTTGCC CAT CAT T T GAAGAGT C AGAAGCT ACT C CT T CAC C AGT T T T GC CT GAC AT T GT 198 9 

Qy 1136 CATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAG 1195 

II I II II II II II I II I II II II I M II II I II II II I II II I I II II III 

Db 1990 TAT G GAAGC AC CAT T GAAT T CT GCAGT T C CT AGT GCT GGT G CT T C C GT GAT AC AGC C C AG 2049 

Qy 1196 T T CAT CAC CAT T AGAAACT CTT C CT T CAGTT AAT TAT GAAAGCATAAAGT T T GAGC CT GA 1255 

II I I I II I I I M I M I I II I I II II II II I II I II I I II I II I II II II I I 
Db 2 050 C T CAT CAC CAT T AGAAG C T T C T T CAGT TAAT TAT GAAAG CAT AAAAC AT GAGC CT G A 2106 

Qy 12 56 AAAT C C C C CAC CAT AT GAG GAG G C CAT GAAT G T AT CAC T AAAAAAAGAAT C AG GAAT GAA 1315 

III I II I II II II II I I I I I II II I M I II I II II II I II II M I II I II I M M 

Db 2107 AAAC C C C C CAC CAT AT GAAGAG G C CAT GAGT GT AT CAC TAAAAAAAGT AT C AGGAAT AAA 2166 

Qy 1316 T GAAGAAAT C ACAGAGC CT GAAGGT AT T AGT GT AG CT GT T C AGGAAACAGAAG CT C CT T A 1375 

II II I I I I I I I II I I I I I I Mill II MM II I I II II I I II I II II II II 
Db 2167 GGAAGAAAT TAAAGAG C CT GAAAAT AT TAAT GC AG CT CTT CAAGAAACAGAAGCT C CTT A 2226 

Qy 137 6 TAT AT C TAT T G CAT GT GAT T TAAT T AAAGAAACAAAGAT CT C TACT GAAC C GACT C C AGA 14 35 

II II I M I I II I I II II I M M II II M II I II II II I III II II II I I I II II 
Db 2227 TAT AT C TAT T G CAT GT GAT T TAAT T AAAGAAACAAAGC TT T CT G C T GAAC C AGC T C C G GA 2286 

Qy 1436 T T T C T C T AGT TAT T C AGAAAT AG C AGAAGT T GC AC AGC CAGT G C C C GAG CAT T CT GAG CT 1495 

II II I I I II I I II M I II I III II II I I II II I I II I I II II II I I I I II I I I 
Db 2287 T T T C T CT GAT TAT T C AGAAAT GG CAAAAGT T GAAC AGC CAGT GC CT GAT CAT T CT GAG C T 2346 

Qy 14 96 AGT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GACT TAT T T AGT GAT GATT CAAT AC C 1555 

II II II II I M M M II I M II II I II II M II I I I II II II II I II II I II II II M 

Db 2347 AGT T GAAGAT T CC T CAC CT GAT T CT GAAC CAGT T GAC T TAT T T AGT GAT GATT CAAT AC C 2406 

Qy 1556 C GAAGT T C C ACAAAAACAAGAT GAAG C T GT AAT AC T T GT GAAAGAAAAC CT CAC T GAAAT 1615 

M II II I II M II II I II M I M II II M I I II I II II I I I I II I I I II I I 
Db 24 07 T GAC GT T C C ACAAAAACAAGAT GAAACT GT GAT GC T T GT GAAAGAAAGT C T CACT GAGAC 24 66 

Qy 1616 T T CAT CT GAGT CAAT GACAG GACAT GAC AAT AAGG GAAAAC T CAGT G CT T CAC CAT CAC C 1675 

Mill II III II M I I II I MM II II I II I II M II II II II I I MM 

Db 24 67 T T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAAC T CAGT G CT T T GC CACC 2523 



Qy 167 6 T GAGGGAG GAAAACC GT AT T T GGAGT C T T T T C AGC C C AGT T T AG GC AT CACAAAAGAT AC 1735 

I I I II I I II I I I II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 
Db 2524 T GAG G GAG GAAAGC CAT AT T T G GAAT CT T T T AAG C T CAGT T T AGAT AACAC AAAAGAT AC 2583 

Qy 1736 C T TAG C AC CT GAT GAAGT T T CAG CAT T GAC C CAAAAG G AGAAAAT C C CT T T GC AGAT G GA 17 95 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 258 4 C C T GT T AC C T GAT GAAGT T T CAACAT T GAGCAAAAAG G AGAAAAT T C C T T T GC AGAT GGA 2643 

Qy 17 96 GGAGCT CAAT AC T GC AGT T TAT T C AAGT GAT G GCT T AT T CAT T GC T CAG GAAG CAAAC CT 1855 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I 
Db 2644 GGAGCT CAGT AC T GC AGT T TAT T C AAAT GAT GACT TAT T TAT T T CTAAGGAAG C ACAGAT 27 03 

Qy 185 6 AAGAGAAAGT GAAACATTTT CAGATT CAT CT C CGAT T GAGATT AT AGAT GAGTT C CCGAC 1915 

I I I I I I I I I I I I I I I I I I II I II I I I I I II I Mill I I I I I I I I I I I I I I I I I II 
Db 27 04 AAGAGAAACT GAAAC GT T T T CAGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT AC 2763 

Qy 1916 C T T T GT CAGT T CTAAAGC AGAT T CT T CT C CT AC AT TAG C CAG GGAAT ACAC T GAC C T AGA 1975 

II I I I I I I I I II I I I I I I I I I II I I I I I I II I I I I II I I I I I I I II I I I I 
Db 2764 AT T GAT CAGT T C T AAAACT GAT T CAT T TT CT AAAT TAG C CAG GGAAT AT ACT GAC C T AGA 2823 

Qy 1976 AGTAG C CCACAAAAGT GAAATT G CT GACAT C C AGGAT G GAGC T GG GT CAT T GG CT T GT GC 2035 

I I I I I M I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I 

Db 2 824 AGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCAC 2 8 83 

Qy 2 036 AGGAT T GC CC CAT GAC CTTTCTTTCAAGAGT AT ACAACC T AAAGAGGAAGT T C A 2 0 89 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 884 AGAAT T GC C C CAT GAC CT T T CT T T GAAGAAC AT ACAAC C CAAAGT T GAAG AGAAAAT CAG 294 3 

Qy 2090 T GT C C CAGAT GAGT TCTCCAAAGATAGGGGT GAT GTTTCAAAGGTGCCCGT ACT GCCTCC 214 9 

I I I I I I I I I I I I I I I I I I I I I III I I I I II I I I I I I I I M I I I I 
Db 2944 TTTCT CAGAT GACTTTTCTAAAAATGGGTCTGCTACATCAAAGGT GCT CTT ATT GCCTCC 3003 

Qy 2150 AGAT GTTTCTGCTTT GGAT GCT C AAG C AGAGAT AG GC AGC AT AGAAAAAC C CAAAGT T CT 22 09 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3004 AGAT GTTTCTGCTTT GGC CACT CAAG C AGAGAT AGAGAGC AT AGT TAAAC C CAAAGTT CT 3063 

Qy ,2210 TGTGAAAGAAGCCGAGAGAAAACTT CCTT CT GATACAGAAAAAGAGCGAAGAT CTCCATC 2269 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3064 T GT GAAAGAAG CT GAGAAAAAAC TTCCTTCC GAT ACAGAAAAAGAG GACAGAT C AC CAT C 3123 

Qy 22 7 0 T GCT AT ATT T T CAGCAGAGCT GAGT AAAACT T CAGT T GTT GAC CT C CT CT ACT GGAGAGA 2 32 9 

I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 3124 T GCT AT ATT TT CAGCAGAGCT GAGT AAAACT T CAGTT GTTGAC CT C CT GT ACT GGAGAGA 3183 

Qy 2330 CATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGT 238 9 

I I I I I I M I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I II I I I I I M 
Db 3184 CAT T AAGAAGAC T GGAGT GGT GT T T G GT G C C AGC CTAT T C C T GC T GCT T T CAT T GAC AGT 324 3 

Qy 2390 ATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAG 24 4 9 

I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3244 ATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAG 3303 

Qy 2450 CTT TAG GAT AT AT AAGGGT GT GAT C CAG G CT AT C CAGAAAT CT GAT GAAG G C C AC C C ATT 2509 

I I I I I I I I I I II I I I I 1 II I I I I I I I II I I I I I I I I I I II I I I II I I I I I I I M I I I 
Db 3304 CTT TAG GAT AT ACAAG GGT GT GAT C CAAG CTAT C CAGAAAT CAGAT GAAG G C C AC C C ATT 3363 



Qy 2510 C AGG GCAT AT T T G GAAT C T GAAGT T GC T AT AT C T GAGGAGT T GGT T C AGAAGT AC AGCAA 2569 

I M I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I M I I I I I I I I I I I I I I I I I 11 
Db 3364 CAGG G CAT AT C T G GAAT C T GAAGT T GCT AT AT C T GAGGAGT T G GT T C AGAAGT AC AGTAA 3423 

Qy 2570 TTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGA 2 629 

I I I I I I I I I I I I II II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3424 TTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGA 34 83 

Qy 2 630 TGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGC 2 68 9 

I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 84 T GAT T T AGT T GAT T CT C T GAAGT T T G C AGT GT T GAT GT GG GT AT T T ACC T AT GT T G GT GC 3543 

Qy 2690 CTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGT 2749 

I I I I I I II I I I II I I I I I I I II I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I 
Db 3544 CTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGT 3603 

Qy 2750 TAT T TAT GAAC GG C AT C AG G CGCAAAT AG AT CAT TAT C T G G GACT T G C AAATAAGAAT GT 28 09 

I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I M I I II I I I I I I I I I I I I I I I I 
Db 3604 TAT T TAT GAAC GG CAT CAGG C AC AGAT AGAT CAT TAT CT AGGACT T GC AAATAAGAAT GT 3663 

Qy 2 810 T AAAGAT GCT AT GG CT AAAAT C CAAGC AAAAAT C C CT G GAT T GAAG C GT AAAGCT GAAT G 28 69 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 3664 T AAAGAT G CT AT GG CT AAAAT C CAAGC AAAAAT C C CT GGAT T GAAG C GCAAAG CT GAAT G 3723 

Qy 2 870 AGAAAG C CT GAAAGAGTT AACAAT AGAG GAGT T T AT CT TT AAAGG GGAT AT T CAT T T GAT 2929 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 3724 AAAACGCCCAAAATAATTA GT AG GAGT T CAT CTTT AAAGG GGAT AT TC AT TTGA- 3777 

Qy 2 930 T C C ATT G G GGAGGGT C AGG GAAGAACAAAG C C T T GACAT T G CAGT GC AGT T T C AC AGAT C 2 989 

II I I I I I I 

Db 3778 T TAT AC G GAT C 3788 

Qy 2990 TTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTC 304 8 

I I II I II I I II I I III II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M 
Db 3789 T T TAT T T T T AGC CAT G C AC T GT T GT GAG GAAAAAT TAC CT GT CT T GACT GC C AT GT GT T C 38 4 8 

Qy 3049 AT CAT CT TAAGT AT T GTAAGCT GCT AT GT AT G GAT T T AAAT C GTAAT CAT AT T T GT TT T T 3108 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 3849 AT CAT CT TAAGT AT T GTAAGCT GCT AT GT AT G GAT T T AAAC C GTAAT CAT A — TCTTTTT 3906 

Qy 3109 C C T GT AT GAG G C ACT GGT GAAT AAACAAAG AT C T GAGAAAG CT GT AT AT TAC AC T T T GT C 3168 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3907 CCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTTGTT 3955 

Qy 3169 GCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT GACAG 3219 

I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 3956 GCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAG 4015 

Qy 3220 AAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTT 3279 

III II I II I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 016 AAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTT 4 075 

Qy 3280 CT GAAAT GAAA T GT T T AGAC GAGAT CAT GC CAC CAAG GCAG GAGT GAAAAAG CT T G 3335 

I II I II I I I I I I I I I I II I I I I II I I I I I I II I I I I I I III I I I I I I M 

Db 4076 C T GAAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C GGTAAAGCAG GAAT GACAAAG CT T G 4135 



Qy 



3336 CCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATA 3395 



1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

Db 4136 -CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATA 4194 

Qy 3396 AGT AAAT ATAGAT TAT AT AT AT C TAT AT AT AGT GT T T C AC GAAGCTT AG C C C T T T AC CT T 3455 

I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4195 AGTAAATATAGATTATA TAT GT AT AGT GT T T C AC AAAG C T T AGAC CT T T AC C T T 4248 

Qy 3456 CCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGTGTGTAGTC 3509 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I 

Db 42 4 9 - C C AG C C AC C C C ACAGT G C T T GAT AT T T C AGAGT CAGT CAT T GGT TAT AC AT GT GT AGT T 4307 

Qy 3510 C C AAAG C ACAT AAGCT AG G GAGAAAC GT AC T T C T AG GC GC ACT AC C AT CT GT T T T CAAC A 3569 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 43 08 C CAAAG C ACAT AAGCT AGAAGAAGAAAT AT T T C TAG GAGC ACT AC CAT C T GT T T T CAAC A 4367 

Qy 3570 C GAAC C GAC G C CAT GCAAAC AGAACT C C - T CAACATAAACT T CAC T GCAC AGACT TAC T G 362 8 

III Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4368 TGAA AT G C CACACAC AT AGAACT C C AACAAC AT CAAT T T CAT T GCAC AGACT GACT G 4424 

Qy 3629 T AGT T AAT TT T AT CAC — AAACTCTGGACTGAATCTAATGCTTCCAAAAA TGT 367 9 

I I I I I I I I I I I I I I I II II I I I I I I II I I ! I I I I I I I I I I I I I I I Ml 
Db 4425 T AGT TAAT TT T GT CACAGAAT CT AT GGAC T GAAT CT AAT GCT T CCAAAAAT GT T GT T T GT 4 4 84 

Qy 3680 T T GCAAAT AT CAAACATT GTT AT GTAAGAAAATAT AAAT GAC GAT T TAT AC A 3731 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I III I I I I I I I I I I I I I I I 

Db 44 85 TT GCAAAT AT CAAACATT GTT AT GCAAGAAAT TAT TAAT TACAAAAT GAAGATTT AT ACC 4544 

Qy 3732 ATTGTGGTTTAAGCTGTATTGA 3753 

I I I I I I I I I I I I I I I I I I III 
Db 4545 ATTGTGGTTTAAGCTGTACTGA 4 566 



RESULT 5 
AAS09453 

ID AAS09453 standard; cDNA; 4053 BP. 
XX 

AC AAS09453; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human cDNA encoding the Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis ; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukodystrophy ; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 

KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe's disease; AB020693; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 135. .3713 

FT /*tag= a 

FT /product^ "Nogo protein" 



XX 
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XX 
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XX 

PF 12-JAN-2001; 2001WO-US001041 . 
XX 

PR 12-JAN-2000; 2000US-0175707P . 

PR 26-MAY-2000; 200 OUS-02 07 366P . 

PR 29-SEP-2000; 2000US-023637 8P . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR P-PSDB; AAU09453. 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders . 
XX 

PS Example 1; Page 95-100; 109pp; English. 
XX 

CC The sequence (Genbank accession number AB0202693) encodes the human Nogo 

CC protein, a 250kDa myelin-associated axon growth inhibitor. The invention 

CC relates to the use of the nogo receptor, nogo protein, their nucleic 

CC acids, vectors expressing them and antibodies against them, to isolate 

CC agents which block nogo receptor mediated axonal growth. The agent is 

CC useful for treating a central nervous system disorder which is a result 

CC of cranial or cerebral trauma, spinal cord injury, stroke or a 

CC demyelinating disease selected from multiple sclerosis, monophasis 

CC demyelination, encephalomyelitis, multifocal leukoencephalopathy, 

CC panencephalitis, Marchiaf ava-Bignami disease, pontine myelinolysis , 

CC adrenoleukodystrophy, Pelizaeus-Merzbacher disease, Spongy degeneration, 

CC Alexander's disease, Canavan's disease, metachromatic leukodystrophy, 

CC viral infection and Krabbe ' s disease 

XX 

SQ Sequence 4053 BP; 1189 A; 922 C; 922 G; 1020 T; 0 U; 0 Other; 

Query Match 62.1%; Score 2379.4; DB 4; Length 4053; 
Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 



Db 




Qy 



61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 





Db 



906 GT ATTAC C CACTGAAGGAACACTT CAAGAAAAT GT CAGTGAAGCT T CTAAAGAGGT CT CA 965 



Qy 



121 GAGAAG G C AAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T CAGAAT T G GAAT AT 180 




Db 



966 GAGAAG G C AAAAACT CT AC T C AT AGAT AG AGAT T T AAC AG AGT T T T C AGAAT T AGAAT AC 1025 



Qy 



181 T C AGAAAT GGAAT CAT CAT T C AGT G G CT C T CAAAAGG C AGAAC C T G C C GT AAC AGT AG C G 24 0 



Db 102 6 T CAGAAAT GG GAT CAT C GT T CAGT GT CT CT C CAAAAG C AGAAT CT G C C GT AAT AGT AG C A 108 5 

Qy 241 AAT C CT AG GGAC GAAAT AGT T GT GAGGAGT AGAGATAAAGAAGAG GACT T AGTT AGT C T T 300 

I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I 
Db 10 86 AAT CCTAGGGAAGAAATAAT CGT GAAAAATAAAGAT GAAGAAGAGAAGT T AGTTAGT AAT 1145 

Qy 301 AACAT C CT T CAT ACT C AG C AGGAGT TAT C T AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 

I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I III I I I I II 

Db 1146 AACAT C CT T CAT AAT C AACAAGAGT T AC C T AC AGCT CT T AC T AAAT T GGT T AAAGAGGAT 1205 

Qy 361 GAT AGAGT TCTGTCTC C AGAAAAAAC AAAG GACAGTT T T AAG GAAAAGGGAGT T G CAG C A 420 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 12 06 GA AGT T GT GT CT T CAGAAAAAG C AAAAGAC AGT T T T AAT GAAAAG AG AGTT GCAGT G 1262 

Qy 421 GAAGCTT C TAT G G GGG AGGAAT AT G C AGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 4 80 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1263 GAAGCT C C TAT GAGGGAGGAAT AT G C AGACT T CAAAC CAT T T GAGC GAGT AT G GGAAGT G 1322 

Qy 481 AAAGAT AC T T ACAAGCAAGAT AGT GAT GT T T T GAT T GCT GGAG GT AATAT AGAGAGC AAA 54 0 

I I M I I I I III I I I I I I I I I I I I III I I I I 1 I I I I I I I II II I I I I II 
Db 1323 AAAGATAGT AAGGAAGAT AGT GAT AT GT T GGC T GCT GGAG GTAAAAT C GAGAGC AAC 137 9 

Qy 541 TT GGAAGGTAAAGT GGATAAGAAACACT T TT CAGATAGCCTT GAACAAACAAAT C GTGAA 600 

I I I I I I II I I I I I I II I I I III III I I I M I I I I I I I I M II I I I I I III 
Db 138 0 TT GGAAAGTAAAGT GGATAAAAAAT GTT T T GCAGATAGCCT T GAGCAAACTAAT CACGAA 1439 

Qy 601 AAAGATAGT GAAAGCAGTAAT GAT GACACTT CATTTCCCAGT ACAC CAGAAGCT GTAAGA 660 

I I I I II I I I I I II I I I I I I I I I II I I I 1 I II I I I I I I I I I I I I I I I I M I 
Db 144 0 AAAGATAGT GAGAGTAGT AAT GAT GAT AC TTCTTTCCC CAGT AC G C C AGAAG GT AT AAAG 149 9 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AACAAC T GAGAAT GT T 717 

I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I II 

Db 15 00 GAT C GTT CAGGAGCAT AT AT C AC AT GT GCTCCCTT TAAC C CAG C AGC AAC T GAGAG CAT T 155 9 

Qy 718 T CAACAAAC ATT T TTCCCTTGTT GGAAGAT C AT ACTT C GGAAAAT AAGAC AGAT GAAAAA 777 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1560 GCAACAAAC ATT T T T C CT T T GT T AGGAGAT C C T ACTT CAGAAAAT AAGAC C GAT GAAAAA 1619 

Qy 77 8 AAGAT AGAA- AAAAAAAGGC AC AAAT T GTAAC AGAGAAGAAT G CAAGT GT CAAGAC AT C A 8 36 

II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 

Db 1620 AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AAC AGAGAAGAAT ACT AG C AC CAAAACAT C A 167 9 

Qy 8 37 AAC C C TT T C C TT AT G GC AGC AC AGGAGT CT AAGACAGAT T AC GT T AC AAC AGAT CAT GT G 8 96 

M I I I I I I Ml I I I I I I I I I I I I III I I I I I I I I I I II I I II I I I I I II I 
Db 1680 AACCCTTTT CTT GTAGCAGCACAGGATT CT GAGACAGATTAT GT CACAAC AGATAATTT A 1739 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I M 
Db 174 0 AC AAAG GT GACT G AGGAAGT CGT G GCAAAC AT G C C T GAAG G C CT GACT C C AGAT T T AGT A 17 99 

Qy 957 CAG GAAGCAT GT GAAAGT GAAT T GAAT GAAGCT AC T GGT ACAAAAAT T GC C T TT GAAAC A 1016 

I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1800 CAGGAAG CAT GT GAAAGT GAAT T GAAT GAAGT T AC T GGT ACAAAGAT T GCT TAT GAAAC A 1859 

Qy 1017 AAAAT G GAC CT GGT T CAAACT T C AGAAG CT GT G CAGGAGT C ACT T T AC C CT GT AACACAG 1076 

I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I II I I I II I I I I I I M I I 



Db 



18 60 AAAATGGACTTGGTTC7\AACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1919 



Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1920 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1979 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 1980 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2039 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 040 T CAT C AC CAT T AGAAGC T T CT T C AGT T AAT TAT GAAAGC AT AAAAC AT GAG C CT G AA 2096 

Qy 1257 AAT C C C C C AC CAT AT GAG GAG G C CAT GAAT GT AT C AC T AAAAAAAGAAT C AG G AAT GAAT 1316 

II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 

Db 2097 AAC C C C C C AC CAT AT GAAGAGG C CAT GAGT GT AT C ACT AAAAAAAGT AT C AG G AAT AAAG 2156 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 137 6 

II I I I I I I I I I I I II I I I I Mill II I I I I I I I I II II I I I I I I I I I I I I I I 
Db 2157 GAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTAT 2216 

Qy 1377 AT AT CT AT T GC AT GT GAT T T AAT T AAAGAAAC AAAGAT CT CT AC T GAAC C GAC T C C AGAT 143 6 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I III I I I I I I I I I I I III 
Db 2217 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2276 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 1496 

I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 2277 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 2336 

Qy 1497 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

I I I I I 11 I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2337 GT T GAAG AT T C C T C AC C T GAT T C T GAAC C AG T T GAC T TAT T T AGT GAT GAT T C AAT AC C T 2396 

Qy 1557 GAAGTT CCACAAAAACAAGAT GAAGCT GT AAT ACT T GT GAAAGAAAACCT CACT GAAATT 1616 

II I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I M I I I I 

Db 2397 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2456 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAAT AAGGGAAAACT CAGT GCTT CAC CAT CAC CT 167 6 

I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 

Db 2457 T CAT T T GAG T CAAT G AT AGAAT AT GAAAAT AAG GAAAAAC T CAGT G C T T T G C CACCT 2513 

Qy 1677 GAG G GAGGAAAAC C GT AT T T G GAGT C T T T T C AG C C CAGT T TAG G CAT C AC AAAAGAT AC C 1736 

II I I I II I I I I II I I I II I I I I I I I I I III I I I I I I II I I II I I I I II I II I 

Db 2 514 G AG GG AGGAAAG C CAT AT T T G GAAT CT T T T AAG CT CAGT T T AGAT AAC AC AAAAGAT AC C 2573 

Qy 1737 T TAG CAC C T GAT GAAGT T T C AG CAT T GAC C C AAAAG G AGAAAAT C C C T T T G C AG AT G GAG 1796 

I I I II I I II I I II II I I I II I I I I II I I I I I I M I II I I II II II I I I I M I 
Db 257 4 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 2 633 

Qy 1797 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

II I I I II I I I I I I I I I I II I II I I I I I I I I I I I I I Ml II I I I II I I I I M 
Db 2 634 GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2 693 

Qy 1857 AGAGAAAGT GAAAC AT T T T C AGAT T CAT CT C C GAT T GAG AT TAT AG AT GAGT T C C C GAC C 1916 

I I II I II I II I I I I II I II I I II I I I I II I II I I I I I II I I II II I I I I I M II 
Db 2694 AGAG AAACT GAAAC GT T T T C AG AT T CAT CT C CAAT T G AAAT TAT AGAT GAGT T C C CT AC A 2753 



Qy 1917 T T T GT C AGT T C T AAAG C AG AT T CT T C T C CT AC AT T AGC CAGG GAAT AC AC T GAC C T AGAA 1976 

II I II II I I II I I I I I I I I I I III I I I I I I I I I M I I I I I 

Db 2754 T T GAT C AGT T C T AAAACT GAT T CAT T T T C T AAAT T AGC CAGG GAAT AT AC T GAC C TAGAA 2813 

Qy 1977 GT AGC C C ACAAAAGT GAAAT T GCT GAC AT CC AGGAT GGAGCT GGGT CATT GGCTT GT GC A 2 036 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II Ill II 

Db 2 814 GT AT C C C ACAAAAGT GAAAT T GCT AAT G C C C C GGAT GGAG CT GGGT CATT G C CT T G C AC A 2873 

Qy 2037 GGAT T GC C C CAT GAC CT T T CT T T C AAGAGT AT ACAAC C TAAAGAGGAAGT T CAT 2090 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 287 4 GAAT T GC C C CAT GAC CT T T CT T T GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT C AGT 2933 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I M 
Db 2934 TTCTCAGATGACTTTTCTAAAAAT GGGT CTGCTACATCAAAGGT GCT CTTATTGCCTCCA 2993 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT C AAG C AGAGAT AG GCAG CAT AGAAAAACC CAAAGT T C T T 2210 

I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2994 GAT GT TTCTGCTTTGGC C ACT CAAGCAGAGAT AGAGAG CAT AGT TAAACC CAAAGT T C T T 3053 

Qy 2211 GT GAAAGAAGC CGAGAGAAAACTT CCTT CT GATACAGAAAAAGAGCGAAGAT CT CCAT CT 22 7 0 

I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I II I I I M MINI 
Db 3054 GT GAAAGAAGCT GAGAAAAAACTT CCTT CC GAT ACAGAAAAAGAGGACAGAT CACCAT CT 3113 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3114 GC T AT AT T T T CAGCAGAGCT GAGTAAAACT T CAGT T GT T GAC C T C CT GT ACT G GAGAG AC 317 3 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 23 90 

I I I I I I I I I I I I I I I ! I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I II I I 
Db 317 4 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3233 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I M I I II II I I I I I I 
Db 3234 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3293 

Qy 2451 TTT AGGAT ATATAAGGGTGT GAT C CAGGCT AT C CAGAAAT CT GAT GAAGGC CACC CATT C 2510 

I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 3294 T T T AGGAT AT ACAAGG GT GT GAT C CAAGC T AT C CAGAAAT C AGAT GAAGG C C AC C CAT T C 3353 

Qy 2511 AG G GC AT AT T T GGAAT CT GAAGT T G C T AT AT CT GAG GAGT T GGT T C AGAAGT AC AG C AAT 2570 

I I II I I I I I I I I I I I II I I I II II I II I I M I I I I I I I I I I I I I I I I I I I I I I I I Ml 

Db 3354 AG G G CAT AT C T GGAAT CT GAAGT T GC T AT AT CT GAG GAGT T GGT T C AGAAGT AC AGTAAT 3413 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

II I I I I I I II I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M II 

Db 3414 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3473 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I M 
Db 3474 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3533 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I 
Db 3534 TTGTTTAAT GGT CTGACACTACTGATTTTGGCTCTCATTTCACTCTT CAGT GTTCCTGTT 3593 



Qy 2751 AT T TAT GAAC GG C AT C AGGC G CAAAT AGAT CAT TAT C T GG GAC TT GCAAAT AAGAAT GT T 2810 

| I | | I M I I I M 1 I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I M II I I I I I I I I 

Db 3594 ATTTAT GAAC GGCATCAGGCACAGAT AGAT CATT AT CTAGGACTT GCAAATAAGAATGTT 3653 

Qy 2811 AAAG AT G C TAT G G C T AAAAT C C AAG C AAAAAT C C C T G GAT T G AAG C G T AAAG C T G AAT G A 287 0 

| | | | I | | | I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I 1 I I 

Db 3654 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGA 3713 

Qy 2 871 GAAAGC C T GAAAGAGT T AACAAT AGAGGAGTT T AT C T T T AAAGGG GAT AT T CAT T T GAT T 2930 

M I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I M II I I 

Db 3714 AAAC G C C C AAAAT AAT T A GT AG GAGTT C AT CT T T AAAGGGGAT AT T CAT T T GAT T 3768 

Qy 2931 C CAT T GGG G AG G GT C AG GGAAGAACAAAG C CT T GAC AT T GC AGT GC AGT T T C AC 2984 

I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3769 AT AC GG G G G AGG GT C AG GGAAGAAC GAA- C CT T GAC GT T GC AGT GC AGT T T C AC AGAT C G 3827 

Qy 2985 AGAT CT T TAT T T TT AGC AAC G C AGT G- T CT GAGGAAAAAT GAC CT GT C T T GAC T G 303 8 

I I I I I I I I I I I I I I I I I I III II I I I M I I II I I I I I I I I I I I I I I I II 

Db 3828 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 3887 

Qy 3039 CCCTGTGTT CAT CAT CT TAAGT AT T GT AAG CT G C TAT GT AT GGAT T TAAAT C GT AAT CAT 3098 

|| I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 3888 C CAT GT GT T CAT CAT C T TAAGT AT T GT AAG CT G C TAT GT AT GGAT T TAAAC C GT AAT CAT 394 7 

Qy 3099 ATT T GTTTTTCCT GT AT GAGG C ACT G GT GAAT AAAC AAAGAT CT GAGAAAG CT GT AT AT T 3158 

I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3948 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC CT GT AT AT T 3994 

Qy 3159 AC ACT T T GT C G C AG GT AGT CT T G C T GT AT - TT G G GGAAT T G CAAAGAAAGT GGAGC T 3214 

I I I I I I I II I I I I I I I I I I I I II I I I I I II I I I III I I I I I I I I 
Db 3995 T T AC T T T GT T GC AGAT AGT CT T G C C GC AT CT T GG CAAGT T G CAGAGAT GGT GGAG CT 4051 



RESULT 6 
ACC81048 

ID ACC81048 standard; cDNA; 4053 BP. 
XX 

AC ACC81048; 
XX 

DT 22-JUL-2003 (first entry) 
XX 

DE Human NogoA gene. 
XX 

KW Human; Nogo receptor; NgR; CTS domain; neuroprotective; gene therapy; 

KW axonal growth; central nervous system; CNS; Nogo; spinal cord injury; 

KW cranial trauma; cerebral trauma; spinal trauma; stroke; Krabbe ' s disease; 

KW demyelinating disease; multiple sclerosis; monophasic demyelination; 

KW encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis; gene; 

KW ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 135. .3713 

FT /*tag= a 

FT /product^ "Human NogoA" 

XX 



PN WO2003031462-A2 . 
XX 

PD 17-APR-2003. 
XX 

PF 04-OCT-2002; 2002WO-US032007 . 
XX 

PR 06-OCT-2001; 2001US-00972599 . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2003-393433/37. 

DR P-PSDB; ABR59667. 
XX 

PT New human Nogo receptor polypeptides and nucleic acids, useful for 

PT decreasing inhibition of axonal growth by a central nervous system 

PT neuron, or in treating central nervous system disease, disorder or 

PT injury, e.g. spinal cord injury. 
XX 

PS Disclosure; Page 126-131; 148pp; English. 
XX 

CC The invention relates to a novel nucleic acid encoding a polypeptide 

CC comprising amino acid residues 27-309 of a 473 amino acid sequence (PI, 

CC human Nogo receptor (NgR) NTLRRCT domain) , or residues 27-309 of PI with 

CC 1-20 conservative amino acid substitutions, and less than a complete CTS 

CC domain, provided that a partial CTS domain, if present, consists of no 

CC more than the first 39 consecutive residues. The nucleic acid of the 

CC invention has neuroprotective activity. The polynucleotide may have a use 

CC in gene therapy. The nucleic acid is useful for decreasing inhibition of 

CC axonal growth by a central nervous system (CNS) neuron. The NgR 

CC polypeptide or an agent inhibits the binding of Nogo to NgR or NgR- 

CC dependent signal transduction in the central nervous system neuron may be 

CC used in treating central nervous system disease, disorder or injury, e.g. 

CC spinal cord injury. Expression of an NgR protein may be associated with 

CC inhibition of axonal regeneration following cranial, cerebral or spinal 

CC trauma, stroke or a demyelinating disease, such as multiple sclerosis, 

CC monophasic demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, or Krabbe 1 s disease. The present 

CC sequence is used in the exemplification of the invention 

XX 

SQ Sequence 4053 BP; 1189 A; 922 C; 922 G; 1020 T; 0 U; 0 Other; 

Query Match 62.1%; Score 2379.4; DB 8; Length 4053; 

Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14 



Qy 



1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 





Db 



846 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 



Qy 



61 GT AC T G C C C AC T GAAG GAAC AC T T C CAGCAAC TT CAAAT GAAG CT T C T AAAGC AT T CT C A 12 0 





Db 



906 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 965 



Qy 



121 GAGAAG G C AAAAAAT C CAT T T G T AG AG AG AAAT T T AAC AGAAT T T T C AG AAT T G GAAT AT 180 
M M II I M I I I I M I I I I I I Ml I I I I I I I I I I I M I I I I I I I 



Db 966 GAGAAG GCAAAAACT C TACT C AT AGAT AGAGAT T T AAC AGAGT T T T C AGAAT T AGAAT AC 1025 

Qy 181 T CAGAAAT G GAAT CAT CAT T C AGT G GC T C T C AAAAGGC AGAAC CT GC C G T AAC AGT AG C G 24 0 

I I I I II I I I I I I I I I I Mi I I II I III I I I I I I I I I II I I I I II I I I I 
Db 102 6 T CAGAAAT GGGAT CAT C GTT CAGT GT CT CT C CAAAAGC AGAAT CT GC C GTAAT AGTAGCA 1085 

Qy 241 AAT C C TAG G GAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAGGAC T T AGT T AGT CT T 300 

I I I I I I I I I I I MINI I I I I I I II I I I I I I I I I I I I I I I I II I I I I I 
Db 108 6 AAT C CT AG G GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 114 5 

Qy 301 AAC AT C CT T CAT ACT C AG CAG GAGT T AT CT AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 

I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I III I II I II 

Db 1146 AAC AT C CT T CAT AAT CAACAAGAGT T AC C T AC AG CT CT T ACTAAATT G GT T AAAGAGGAT 1205 

Qy 361 GAT AGAGT T CT GT CT C C AGAAAAAACAAAG GAC AGT T T T AAG GAAAAG G GAGT T GCAGC A 420 

II I I I I I I I I I MINIM I I I I II I M II I II I I II M I I I I I I II II 

Db 1206 GA AGT T GT GT CT T C AGAAAAAGCAAAAGAC AGT TT TAAT GAAAAGAGAGT T GC AGT G 1262 

Qy 421 GAAGC T T C TAT GGGG GAG GAAT AT G C AGACT T CAAAC CAT T T GAG CG AGT AT GGGAAGT G 48 0 

I I I I I I I I I I I M I I I I II I I I I I II I I I I I I II II I I I M I I I I I I I I I II I I I M I 
Db 1263 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1322 

Qy 481 AAAGAT ACT TACAAGCAAGAT AGT GAT GT T T T GAT T G C T G GAG GTAAT AT AGAGAGC AAA 540 

M I I II I I III I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I II I I 
Db 1323 AAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCTG GAGGT AAAAT C GAGAGC AAC 137 9 

Qy 541 T T GGAAGGT AAAGT GGAT AAGAAACACT T T T CAGAT AGC CT T GAACAAACAAAT C GT GAA 600 

I II I I I II M II M I II I I III III I I M I I I I II I II I I I I I I II I III 

Db 1380 T T GGAAAGT AAAGT GGAT AAAAAAT GT T T T GCAGAT AGC CT T GAG CAAACT AAT CAC GAA 1439 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GACACTT CAT T T C C C AGT ACAC C AGAAGCT GTAAGA 660 

II I I I I I I II I II I I I I II I I I II I I I I I II I I I I I I I I I II I I I I I III 

Db 144 0 AAAGATAGT GAGAGT AGT AAT GAT GAT ACTT CT T T C C C C AGT AC GC C AGAAG GT AT AAAG 1499 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AAC T G AGAAT GTT 717 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II II 

Db 1500 GAT C GT T C AGGAG C AT ATAT CAC AT GTGCTCCCTT TAACC C AGC AGCAAC T GAGAGC AT T 1559 

Qy 718 T C AAC AAAC AT TTTTCCCTTGTT GGAAG AT CAT AC T T C GGAAAAT AAGAC AGAT GAAAAA 777 

I I I I I I II I I I I M I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I M I II 
Db 1560 G C AAC AAAC AT TTTTCCTTTGT TAG GAGAT C C T AC T T C AGAAAAT AAG AC C GAT GAAAAA 1619 

Qy 77 8 AAGATAGAA- AAAAAAAG GC AC AAAT T GT AAC AGAGAAGAAT GCAAGT GT CAAG ACAT CA 83 6 

M I I I II I I I I I I I I I I I I I I I M I I II I I II I I I I I I II III I I I I II 
Db 1620 AAAAT AGAAGAAAAGAAG GC C C AAAT AG TAAC AGAGAAGAAT ACT AG CAC CAAAACAT CA 1679 

Qy 837 AAC C CT T T C CT T AT G GCAGC AC AGGAGT C T AAGACAGATT AC GT T ACAAC AGAT CAT GT G 896 

II I I I I I I I I I I I I I I I I I I I I I Ml I II I I I II I I II I II II I I I I II I 

Db 168 0 AAC C CT T T T CT T GT AGC AGC AC AGGAT T CT GAGACAGATT AT GT CACAAC AGAT AAT T T A 1739 

Qy 897 T CAAAG GT GAC C GAG GAAGT AGT GG CAAAC AT G C C T GAAGGT C TAAC C C CAGAT TT GGT T 956 

I II I I I I M I I I I I I I I I I I I I I I I I I I M I I M M I I II II I I I II I I I M 
Db 174 0 ACAAAGGT GAC T GAG GAAGT C GT GG CAAAC AT G C C T GAAGGC C T GACT C C AGAT TT AGT A 1799 

Qy 957 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAG C T AC T GGT ACAAAAAT T G C C T T T GAAAC A 1016 

I I M I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1800 C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGT T AC T GGT AC AAAGAT T GC T TAT GAAAC A 1859 



Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db I8 60 AAAAT G GACTT G GT T CAAAC AT C AGAAGT TAT G CAAGAGT C ACT CT AT C C T GC AG CAC AG 1919 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I II I I I I I I I I II II I I I 
Db 192 0 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1979 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN 
Db 1980 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2039 

Qy 1197 T CAT CAC CAT T AGAAACT C T T C CT T C AGT T AAT TAT GAAAGC AT AAAGT T T GAGCC T GAA 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 

Db 2 04 0 T CAT CAC CAT T AGAAG C TT C T T CAGT T AAT TAT GAAAG CAT AAAAC AT GAG C CT GAA 2 0 96 

Qy 1257 AAT C CC CCAC CAT AT GAGGAGGCCAT GAAT GT AT CACTAAAAAAAGAAT CAGGAAT GAAT 1316 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 20 97 AACCCCC CAC CAT AT GAAGAGGCC ATGAGT GTAT CACTAAAAAAAGT AT CAGGAAT AAAG 2156 

Qy 1317 GAAGAAAT C ACAGAGC CT GAAGGT AT T AGT GT AG C T GT T C AG GAAAC AGAAGC T CC T TAT 137 6 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2157 GAAGAAAT T AAAGAG C CT GAAAAT AT T AAT GCAG CT CT T CAAGAAAC AGAAGCT C CT T AT 2216 

Qy 1377 ATATCT ATT GCAT GT GATTTAATTAAAGAAACAAAGAT CT CTACT GAACCGACT CCAGAT 14 36 

I I I I I I II I I I I I I I I II I II I I I I I I I I I I I II II I III I I I I I I I I I I I Ml 
Db 2217 AT AT CT AT T GCAT GT GAT T TAAT T AAAGAAACAAAG CT TT CT GCT GAAC C AGCT C C GGAT 2276 

Qy 1437 T T CT C T AGT TAT T CAGAAAT AG CAGAAGT T G CAC AGC CAGT G C C C GAGC AT T C T GAGC T A 14 96 

I I I I I I I I I I I I I I I II I III I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I 
Db 2277 T T CT CT GAT TAT T CAGAAAT GGCAAAAGT T GAAC AGC CAGT G C CT GAT CAT T CT GAG CT A 2336 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GACTT AT T T AGT GAT GAT T CAAT AC C C 1556 

I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I II I I II I I I II I I I I I 

Db 2337 GT T GAAGAT T CC T CAC CT GAT T CT GAAC C AGT T GACT T AT T T AGT GAT GAT T CAAT AC CT 2396 

Qy 1557 GAAGTT CCACAAAAACAAGAT GAAGCT GTAATACT T GT GAAAGAAAACCT CACT GAAAT T 1616 

II I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 2397 GACGT T C CACAAAAACAAG AT GAAACT GT GAT GC T T GT GAAAGAAAGT CT C AC T GAGACT 2456 

Qy 1617 T CAT CT GAGT CAAT GACAG GAC AT GACAATAAG GGAAAACT CAGT GC T T CAC CAT CAC C T 1676 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 2457 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAG GAAAAACT C AGT GC T T T GC CACCT 2513 

Qy 1677 GAGGGAGGAAAAC C GT AT T T G GAGT CT T T T C AGC C CAGT T T AGGCAT C ACAAAAGAT AC C 1736 

I I I I I I I I I I I II I II I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 2514 GAGG GAGGAAAGC CAT AT T T G GAAT C T T T TAAG C T CAGT T TAG ATAACACAAAAGAT AC C 2573 

Qy 1737 T TAG CAC CT GAT GAAGT T T C AG CAT T GAC C CAAAAGGAGAAAAT C C C T T T GC AGAT GGAG 1796 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2574 C T GT T AC CT GAT GAAGT T T CAAC AT T GAG CAAAAAGGAGAAAAT T C CT T T GC AGAT GGAG 2633 

Qy 1797 GAG CT CAAT AC T G CAGT T TAT T CAAGT GAT G GCT TAT T CATT GC T C AG GAAG CAAACC T A 1856 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I II 
Db 2 634 GAG CT CAGT AC T G CAGT T TAT T CAAAT GAT GACT TAT T TAT T T C TAAGGAAG CAC AGAT A 2 693 



Qy 1857 AG AGAAAGT GAAAC AT T T T C AGAT T CAT C T C C GAT T G AGAT T AT AGAT G AGT T C C C GAC C 1916 

I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2694 AGAGAAAC T GAAAC GT T T T C AG AT T CAT C T C C AAT T GAAAT TAT AGAT GAG T T C C C T AC A 2753 

Qy 1917 T T T GT C AGT T CT AAAGCAGAT T C T T CT C CT AC AT TAG C CAGGGAAT ACAC T GAC CTAGAA 197 6 

II M I I II I II I I I Mill I I III I I I I I I I I I I I I I I I I I I I I I II I II I 

Db 2 7 54 T T GAT CAGT T CT AAAACT GAT T CAT T T T C T AAAT TAG C CAGG GAAT AT AC T GAC C T AGAA 2 813 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

Ml I I I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I II I I 
Db 2814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2 873 

Qy 2037 G GAT T G C C C CAT GAC CTTTCTTT C AAG AGT AT AC AAC C TAAAGAGGAAGTTCAT 2 0 90 

I I M I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 2874 GAAT T GC C C CAT GAC CTTTCTTT GAAGAAC AT AC AAC C C AAAGT T GAAGAG AAAAT CAGT 2 933 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I I I I I I I II I I II I I I I I III I II I I I I I I I I II I I I I I I I I 
Db 2934 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2993 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AGG CAGCAT AGAAAAAC C CAAAGT T CT T 2210 

I I I II I I I I I I I II I I I I II I II I I I I I II I I II I I I I I I I I I I I I I I II I I 1 
Db 2994 GAT GTTTCTGCTTTGGC C AC T CAAGC AGAGAT AGAGAGCAT AGT T AAAC C CAAAGT T CT T 3053 

Qy 2211 GT GAAAGAAGCCGAGAGAAAACTT C CTTCT GATACAGAAAAAGAGCGAAGAT CT C CAT CT 2270 

I I I I I II I I I I I I I I I I I I I I I II II I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 3054 GT GAAAGAAGCT GAGAAAAAAC TT C C T T C C GAT ACAGAAAAAGAGGAC AGAT C AC C AT CT 3113 

Qy 2271 GC T AT AT T T T C AGC AGAGCT GAGT AAAAC T T CAGT T GTT GAC CT C CT CT AC T GGAGAGAC 2330 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 3114 G CT ATAT T T T C AGC AGAGC T GAGTAAAACT T CAGT T GTT GAC CT C C T GT AC T GGAGAGAC 3173 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I II I M I I I M I I I I I I I I I I II I I I II I II I I I I I 
Db 3174 AT T AAGAAGACT G GAGT GGT GT TT G GT GC C AG C CT AT TCCTGCTGCTTT CAT T GACAGT A 3233 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3234 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 32 93 

Qy 2451 T T TAGGAT AT AT AAGG GT GT GAT C CAGGC T AT CC AGAAAT C T GAT GAAG G C CAC C CAT T C 2 510 

I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I 
Db 3294 T T TAG GAT AT ACAAGGGTGT GAT C CAAGC TAT C C AGAAAT C AGAT GAAG GC CAC C CAT T C 3353 

Qy 2511 AGG G CAT AT T T G GAAT C T GAAGT T G C TAT AT C T GAG GAGT T GGT T CAGAAGT AC AGC AAT 2570 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I III 
Db 3354 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 3413 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3414 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 34 73 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 690 

I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 3474 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3533 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 



Db 3534 TTGTTT7\ATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3593 

Qy 2751 AT T TAT GAAC G G CAT C AG G C GC AAAT AGAT CAT TAT CT G G GACT T GCAAATAAGAAT GT T 2810 

I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 3594 AT T TAT GAAC GG CAT CAG G CAC AG AT AGAT CAT TAT CT AG GACT T G CAAAT AAGAAT GT T 3653 

Qy 2 811 AAAGAT G CT AT G G C T AAAAT C C AAGC AAAAAT C C C T G GATT GAAG C GT AAAG C T GAAT G A 2 87 0 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I II I I I I I I I I 

Db 3654 AAAGAT G C TAT G GCT AAAAT C CAAG CAAAAAT C C C T GG ATT GAAGC G CAAAGCT GAAT GA 3713 

Qy 2 871 GAAAGC CT GAAAGAGT T AACAAT AGAG GAGT T TAT CT T T AAAG G GGAT AT T CAT T T GAT T 293 0 

II I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I 

Db 3714 AAAC GC C CAAAAT AAT T A GT AGGAGT T CAT C TT TAAAG G GGAT AT T CAT T T GAT T 3768 

Qy 2931 C CAT T G GGGAGGGT CAG GGAAGAACAAAGC C T T GACAT T GCAGT GC AGT T T CAC 2984 

I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I II I II 
Db 3769 AT AC G GGG GAG G GT C AGGGAAGAAC GAA- C CT T GAC GT T GCAGT G C AGT T T CAC AGAT C G 3827 

Qy 2985 AGAT CT T TAT T T TT AGCAAC G C AGT G - T CT GAGGAAAAAT GACC T GT CT T GAC T G 3038 

I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 382 8 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 38 87 

Qy 3039 C C CT GT GT T CAT C AT CT TAAGT AT T GT AAGCT G CTAT GT AT G GAT T TAAAT C GT AAT CAT 3098 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I 

Db 3888 C CAT GT GT T CAT C AT CT TAAGT AT T GTAAG CT GCT AT GT AT GGAT T TAAAC C GTAAT CAT 394 7 

Qy 3099 AT T T GT T T T T C CT GT AT GAGGC ACT G GT GAAT AAACAAAGAT C T GAGAAAGCT GT AT ATT 3158 

I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I II II 

Db 394 8 A — TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAAC C T GT AT AT T 3994 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I II I I I I I II I I II I I I I I I II I II I I I II I I III I I I I I I I I 
Db 3995 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4 051 
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ID AAA23454 standard; cDNA; 4093 BP. 
XX 

AC AAA23454; 
XX 

DT 19-JUN-2000 (first entry) 
XX 

DE cDNA encoding human secreted protein vb22_l, SEQ ID NO: 63. 
XX 

KW Human; secreted protein; cancer; tumour; cardiovascular disorder; 

KW blood disorder; haemophilia; autoimmune disease; diabetes; inflammation; 
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OS Homo sapiens. 
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FH Key Location/Qualifiers 
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FT /product^ "Clone vb22 1 ORF2" 
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FT /product= "Human secreted protein vb22_l" 
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XX 

PA (ALPH-) ALPHAGENE INC. 
XX 

PI Valenzuela D, Yuan O, Hoffman H, Hall J, Rapiejko P; 
XX 

DR WPI; 2000-224657/19. 

DR P-PSDB; AAY95012, AAY95030. 

XX 

PT New secreted or transmembrane proteins and polynucleotides encoding them, 

PT useful for treating neurodegenerative disorders, autoimmune diseases and 

PT cancer. 
XX 

PS Claim 72; Page 321-322; 357pp; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94 98 1-Y9502 0 ) , 

CC and cDNA sequences encoding them (AAA23423-A234 62 ) . The secreted proteins 

CC of the invention include those that are thought to be only partially 

CC secreted, i.e., transmembrane proteins. The proteins of the invention may 

CC exhibit one or more activities selected from the following: cytokine 

CC activity; cell proliferation; differentiation; immune modulation; 

CC haematopoiesis regulation; tissue growth activity; activin/inhibin 

CC activity; chemotactic/chemokinetic activity; haemostatic and thrombolytic 

CC activity; anti-inflammatory activity; and tumour inhibition activity. The 

CC proteins may be administered to patients as vaccines, and the nucleotides 

CC may be used as part of a gene therapy regime. Diseases or conditions that 

CC may be treated using the proteins or nucleotides of the invention include 

CC autoimmune diseases; genetic disorders; haemophilia; cardiovascular 

CC diseases; cancer; bacterial, fungal and viral infections, especially HIV; 

CC multiple sclerosis; rheumatoid arthritis; pulmonary inflammation; 

CC Guillain-Barre syndrome; insulin dependent diabetes mellitus; and 

CC allergic reactions such as asthma and anaemia. They may also be used for 

CC treating wounds, burns, ulcers, osteoporosis, osteoarthritis, periodontal 

CC diseases, Alzheimer f s disease, Parkinson's disease, Huntington's disease 

CC and amyotrophic lateral sclerosis (ALS) . Proteins with activin/inhibin 

CC activity may additionally be useful as contraceptives. Nucleic acid 

CC sequences of the invention may be used in chromosome mapping, and as a 

CC source of diagnostic primers and probes. The present sequence represents 

CC cDNA encoding one of the 4 0 proteins of the invention 



SQ Sequence 4093 BP; 1213 A; 926 C; 928 G; 1026 T; 0 U; 0 Other; 



Query Match 61.8%; Score 2370.4; DB 3; Length 4093; 

Best Local Similarity 86.4%; Pred. No. 0; 

Matches 2807; Conservative 0; Mismatches 386; Indels 55; Gaps 15; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I II I I I I I I I I I I I I I I Mil I I I I I I I I I I I I I I I I I I I I I II I II I I 

Db 8 63 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 922 

Qy 61 GTACTGCCCACTGAAGG7WCACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 

III I I I I I I I I I I 1 I I I II I I I I II II I I I I I I I I I I I I I I I I I I I I I 

Db 923 GT ATTACC CACT GAAGGAACACTT CAAGAAAAT GT CAGT GAAGCTT CTAAAGAGGT CT CA 982 

Qy 121 GAGAAGGCAAAAAAT C CATTT GTAGAGAGAAATTTAACAGAATTT T CAGAAT T GGAAT AT 18 0 

I I I I I I I I I I I I II I I I I I I III I II I I I I I I I II I I I I I I I I I I I I I I 
Db 983 GAGAAGGC - AAAACT CT ACT C AT AGAT AGAGAT T TAAC AGAGT T T T CAGAAT T AGAAT AC 1041 

Qy 181 T C AGAAAT G GAAT CAT CAT T CAGT G G CT C T CAAAAGGC AGAAC C T G C CGT AAC AGT AGC G 24 0 

I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I II I I I I I I I 
Db 1042 TC AGAAAT GGGAT CAT C GTT CAGT GT CT CT C CAAAAG CAGAAT CT GC CGT AATAGTAGCA 1101 

Qy 241 AAT C CT AGGGAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAGGACT T AGT T AGT C T T 300 

I I I II I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I 
Db 1102 AAT C C TAG G GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 1161 

Qy 301 AACAT CCTT CAT ACT CAGCAGGAGTT AT CTACAGT CCTTACGAAAT CAGTT GAAGAAGAA 360 

I I I I I I I I I I I I I III II I I I I II I I I I I I Mill I I II III I I I I II 

Db 1162 AACAT CCTT CAT AAT CAACAAGAGT T AC CT AC AG CT CTT AC TAAAT T GGT T AAAGAG GAT 1221 

Qy 361 GAT AGAGT T CT GT CT C C AGAAAAAAC AAAGGAC AGT T TT AAG GAAAAGG GAGTT G CAGCA 420 

II I I I I II II I I I II II II MM I II I I II I I II I II I I I II I I I I II I 

Db 1222 GA AGTT GT GT CTT CAGAAAAAGCAAAAGACAGTTTTAAT GAAAAGAGAGTTGCAGT G 1278 

Qy 421 GAAGC T T C TAT G GG GGAG GAAT AT GC AGAC T T CAAAC CAT T T GAGC GAGT AT GGGAAGT G 480 

I I M I I I II II II II II II I I I II II I II I I I II II I II I I I I M I II M I I II II I I 

Db 1279 GAAG C T C CT AT GAGGGAGGAAT AT GC AGACT T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 133 8 

Qy 481 AAAGAT ACT T AC AAGCAAGAT AGT GAT GT T T T GAT T GCT G GAGGT AAT AT AGAGAGCAAA 540 

II I II I I I III M II II I I II I I III II II I I I II II I II I II I II I I 

Db 1339 AAAGATAGT AAGGAAGATAGT GAT AT GTT GGCT GCT GGAGGTAAAAT C GAGAGCAAC 1395 

Qy 541 T T GGAAGGT AAAGT G GATAAGAAAC ACT T T T C AGAT AGC CT T GAAC AAAC AAAT C GT GAA 600 

II II I I II I I I II II I I II III III I I I II II I II II I I I II I I I I I III 
Db 1396 T T GGAAAGT AAAGT G GAT AAAAAAT GT TT T G C AGAT AGC CT T GAGC AAACT AAT C AC GAA 1455 

Qy 601 AAAGATAGT GAAAG CAGT AAT GAT GACACTT CATTTCCCAGTACACCAGAAGCTGTAAGA 660 

II II II I II II II II II II II I II I I I I I M I I I I I II I I M I I I I I Ml 
Db 1456 AAAGATAGT GAGAGT AGT AAT GAT GAT AC TTCTTTCCC CAGT AC G C CAGAAG GT ATAAAG 1515 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I I I I I I II I I II I II I I II I I I I I I II II II I I II I II I I I M I II 
Db 1516 GAT C GT T C AG GAGC AT AT AT C ACAT GTGCTCCCTT TAAC C CAG CAG CAACT GAGAGC AT T 1575 

Qy 718 T C AAC AAAC AT TTTTCCCTTGTTG GAAG AT CAT AC T T C G G AAAAT AAG AC AG AT GAAAAA 777 

I II I II I I II I I I I I I II II I I II I II I II I II I I I I II II I II II I I I M II 
Db 1576 GC AACAAAC AT T TT T C CT T T GT T AG GAG AT C CT AC T T C AGAAAAT AAGAC C GAT GAAAAA 1635 



Qy 778 AAGATAGAA-AAAAAAAGGCACAAATTGTAACAGAGAAGAATGCAAGTGTCAAGACATCA 836 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 
Db 1636 AAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCA 1695 

Qy 837 AAC CCTTTCCT TAT G GC AG CAC AG GAGT CT AAGACAGAT T AC GT T ACAAC AGAT CAT GT G 896 

I I I I I I I I III I I I I I I I I I II I III I I I I I I I I II II I I I I I I I I I II I 
Db 1696 AAC CCTTTTCTT GT AG C AG CAC AG GAT T C T GAGACAGAT TAT GT CACAAC AGATAAT T T A 1755 

Qy 897 T CAAAGGT GAC C GAGGAAGT AGT GG CAAAC AT G C CT GAAG GT C T AAC C C C AGAT T T GGT T 956 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 1756 ACAAAGGT GAC T GAG GAAGT C GT G GCAAAC AT G C CT GAAGGC CT GACT C C AGAT T T AGT A 1815 

Qy 957 CAGGAAGCAT GT GAAAGT GAATT GAAT GAAGCTACT GGTACAAAAATT GC CTTT GAAACA 1016 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1816 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGTTACT GGTACAAAGATT GCTTAT GAAACA 1875 

Qy 1017 AAAAT GGAC CT G GT T CAAACT T C AGAAG CT GT G C AGGAGT CAC T T T AC C C T GT AACACAG 107 6 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1876 AAAAT GGACT T GGT T CAAAC AT C AGAAGT T AT GCAAGAGT CACT C TAT CCT GC AGC ACAG 1935 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1936 C T TT GC C CAT CAT T T GAAGAGT CAGAAG CT ACT C CT T CAC CAGT T TT G CCT GAC AT T GT T 19 95 

Qy 1137 AT GGAAGCACCATTAAAT T CT GT AGTT C CT AGT GCT GGT GCTT CT GCAGT GC AGCT CAGT 1196 

I I I I I I I I I II II I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I II I 
Db 1996 AT GGAAG CAC CAT T GAAT T CT G C AGTT C CT AGT GCTGGTGCTTCCGT GAT AC AGC C C AGC 2 055 

Qy 1197 T CAT CAC CATT AGAAACT CTTC CTT CAGTTAATTAT GAAAGCATAAAGTTT GAGCCT GAA 125.6 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 2056 T CAT CAC CAT T AGAAG CTT CTT CAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2112 

Qy 1257 AAT C C C C CAC CAT AT GAGGAGGCC AT GAAT GTAT CACTAAAAAAAGAAT C AGGAAT GAAT 1316 

II I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I MINN! II 

Db 2113 AAC C C C C CAC CAT AT GAAGAGG CC AT GAGT GTAT C ACTAAAAAAAGT AT C AGGAAT AAAG 2172 

Qy 1317 GAAGAAAT CAC AGAGC CT GAAG GT ATT AGT GT AG C T GT T C AGGAAAC AGAAGC T C C T TAT 1376 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 2173 GAAGAAAT TAAAGAGCCT GAAAAT ATTAAT GCAGCT CTT CAAGAAACAGAAGCT CCTTAT 2232 

Qy 1377 AT AT CT AT T G CAT GT GAT T T AAT TAAAGAAACAAAGAT C T CT ACT GAACC GACT C C AGAT 1436 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I III 
Db 2233 AT AT CT AT T GCAT GT GAT T T AATT AAAGAAACAAAG CT T T CT GCT GAACC AGCT C C G GAT 2292 

Qy 1437 T T CT CT AGT TAT T CAGAAAT AG C AGAAGT T G CAC AGC CAGT GC C C GAG CAT T C T G AGCT A 14 96 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I 
Db 2293 T T CT CT GAT TAT T CAGAAAT G G CAAAAGTT GAAC AGC CAGT GC C T GAT CAT T C T GAGC T A 2 352 

Qy 1497 GT T GAAGAT TCCTCCCCC GATT CT GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC C C 1556 

I I I I I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 353 GT T GAAGAT T C C T CAC CT GAT T CT GAAC C AGT T GACT TAT T T AGT GAT GAT T CAAT AC C T 2 412 

Qy 1557 GAAGTT C CACAAAAACAAGAT GAAGCT GTAATACTT GT GAAAGAAAAC CT CACT GAAAT T 1616 

II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 2413 GAC GT T C CACAAAAACAAGAT GAAACT GT GAT GCT T GT GAAAGAAAGT C T CAC T GAGAC T 2472 



Qy 1617 T CAT CT GAGT CAAT GACAG GAC AT G ACAAT AAGG GAAAACT C AGT GCT T C AC CAT CAC CT 1676 

I I I I I I I I I I I I I I I II I I I I I I II I I II M I I I I I I I I I I I I I I I II I 
Db 2473 T CAT T T GAGT CAAT GAT AGAAT AT GAAAATAAG GAAAAACT CAGT GC T T T GC CACCT 252 9 

Qy 1677 GAGG GAGGAAAAC C GT AT T T G GAGT C T T T T C AGC C CAGT T T AGG C AT C AC AAAAGAT AC C 173 6 

I I I I I I I I I I I II I I I I I I I I I I I I II III I I I II I I I I I I I I I I I I II I I I 
Db 2530 GAG GGAGGAAAGC CAT AT T T G GAAT CT T T T AAG CT CAGT T T AGAT AAC ACAAAAGAT AC C 2589 

Qy 1737 T TAG C AC CT GAT GAAGT T T CAG CAT T GAC C CAAAAG GAGAAAAT C C CT T T G C AGAT GGAG 1796 

I I I I I I I I I I I I II I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2590 C T GT T ACCT GAT GAAGT T T C AAC AT T GAG CAAAAAGGAGAAAAT T C CT T T GC AGAT G GAG 264 9 

Qy 1797 GAGCT CAAT AC T GCAGT T TAT T CAAGT GAT G GCTT AT T CAT T GC T CAGGAAGCAAAC CT A 1856 

I I I I I II I II I I II I I I I I I I I I I I I I I I MM; III II I I I I I I I I I II 

Db 2650 GAGCT CAGT ACT GCAGTTTATT CAAAT GAT GACTTATTT ATTT CTAAGGAAGCACAGATA 2709 

Qy 1857 AGAGAAAGT GAAAC AT T T T CAG AT T CAT C T C C GAT T GAG AT TAT AG AT GAGT T C C C GAC C 1916 

II I II I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

Db 2710 AGAGAAAC T GAAAC GT T T T C AGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT ACA 2769 

Qy 1917 T T T GT CAGTT CT AAAGC AGAT T CT T C T C CT AC ATT AGCC AG GGAAT AC ACT GAC C TAGAA 197 6 

II I II I I I I I I I I I I I I I I I I III II I II I I I I I I I I I I I I I II I I I I I I I 

Db 2770 T T GAT CAGTT CT AAAAC T GAT T CAT T T T C T AAATT AG C C AGG GAAT AT ACT GAC CTAGAA 2 829 

Qy 1977 GT AGC C CACAAAAGT GAAAT T GC T GAC AT C C AGGAT G GAGC T G GGT CAT TGGCTTGTG C A 203 6 

III I I I I I I I I I I I I II I I I I I I I II I I I I I I I II I I I I I I I I I I II I I II 

Db 2830 GT AT C C CACAAAAGT GAAAT T GC T AAT GC C C C G GAT GGAG CT GGGT CAT T G C C T T G CAC A 28 89 

Qy 2037 GGAT T G C C C CAT GAC CT T T CT TT CAAGAGT AT ACAAC CT A AAGAGGAAGTTCAT 2090 

I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 2 8 90 GAAT T G C C CC AT GAC CTTTCTTT GAAGAAC AT ACAAC CCAAAGT T GAAGAGAAAAT CAGT 294 9 

Qy 2091 GT C C C AGAT GAGT T CT C CAAAGAT AG G GGT GAT GT T T CAAAGGT GC C C GT ACT G C C T C C A 2150 

I I I I I I I I I I I I I I I I I I II III I I I I I I II I I I I I I I I II II I 
Db 2950 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 3009 

Qy 2151 GAT GT T T CT G CT TT GGAT GCT CAAGC AGAG AT AGGCAGC AT AGAAAAAC C CAAAGT T C T T 2210 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3010 GAT GTTTCTGCTTTGGC C ACT CAAGC AGAGATAGAGAGC AT AGT TAAAC C CAAAGTT CT T 3069 

Qy 2211 GTGAAAGAAGCCGAGAGAAAACT T CCT T CT GATACAGAAAAAGAGCGAAGAT CT CCAT CT 227 0 

I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I II I I I I I I I I I II I I I I II I I 
Db 3070 GT GAAAGAAGCT GAGAAAAAACT T CCTT C CGATACAGAAAAAGAGGAC AGAT CAC CAT CT 3129 

Qy 2271 G CT AT AT T T T C AGCAGAGCT GAGT AAAAC T T CAGT T GT T GAC C T C CT C T AC T G GAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3130 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3189 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I II II I I I I I I 
Db 3190 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 324 9 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3250 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3309 

Qy 2451 T T T AGGAT AT AT AAG G GT GT GAT C CAG G CT AT C CAGAAAT C T GAT GAAGG C CAC C CAT T C 2510 



Db 3310 T TT AGGATATACAAGGGT GT GAT C CAAGCT AT C CAGAAAT CAGAT GAAGGC CACC CAT T C 3369 

Qy 2 511 AG GG C ATATT T GGAAT C T GAAGT T G C TAT AT CT GAG GAGT T GGT T C AGAAGT AC AGC AAT 257 0 

I I I I I I I I I ! I I I I I I I i I II I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I III 
Db 3370 AG G G CAT AT CT GGAAT CT GAAGT T GC T AT AT C T GAG GAGT T G GT T C AGAAGT AC AGT AAT 3429 

Qy 2571 TCTGCTCTT G GT CAT GT T AAC T GCACAAT AAAAGAACT C AGAC GCCTCTTCT T AGT T GAT 2630 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 30 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 34 89 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 90 GAT T T AGT T GAT T CT C T GAAGT T T GCAGT GT T GAT GT GGGT AT T T AC CT AT GTTGGTGCC 354 9 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 3550 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3609 

Qy 2751 AT TT AT GAAC G GC AT C AG GC GCAAAT AGAT CAT TAT CT GGGACT T GCAAATAAGAAT GT T 2810 

I I I I I I I I I I I I I I I I I II I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3610 AT TT AT GAAC G GCAT C AGG C ACAGAT AGAT CAT TAT C T AGGACTT GCAAATAAGAAT GT T 3669 

Qy 2 811 AAAGAT G C TAT G GCT AAAAT C CAAG CAAAAAT C C CT GGAT T GAAGC GT AAAGC T GAAT GA 2 870 

I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3670 AAAGAT GCT AT G GC T AAAAT C CAAGCAAAAAT C C C T GGAT T GAAGC GCAAAG CT GAAT GA 3729 

Qy 2 871 GAAAGCCT GAAAGAGTTAACAAT AGAGGAGTTT AT CTTTAAAGGGGAT ATT CATTT GATT 2 930 

I I I II II I I I I I I I I I I I I II I I I I I II I I I I I II I I I II I I I I I I 

Db 3730 AAAC G C C CAAAAT AAT T A GTAGGAGTT CAT CTTTAAAGGGGAT ATT CATTT GATT 37 84 

Qy 2 931 C CAT T GG G G AGG GT CAG G GAAGAACAAAGC C T T GACAT T GCAGT GC AGT T T C AC 2984 

I I I II I I I I I I I I I I I I I I II II I I I I I I I I M I I I I I I I I I I I I II 
Db 3785 AT AC GGG GGAGG GT CAG G GAAGAAC GAA- C CT T GAC GT T GCAGT GCAGT T T C ACAGAT C G 3843 

Qy 2985 AGAT CT T TAT T T T T AGCAAC GCAGT G- T CT GAG GAAAAAT GAC C T GT C TT GACT G 3038 

I I I I I I I I I I I I II I I I I III II I I I I I I I I I M I I I I I I I I I I I I II I 
Db 384 4 T T GT T AGAT CT T TAT T T T TAG C CAT GC ACT GT T GT GAG GAAAAAT T AC CT GT CT T GACT G 3903 

Qy 303 9 C C CT GT GT T CAT CAT C T TAAGT AT T GTAAG CT G C TAT GT AT GGAT T TAAAT C GTAAT CAT 3 098 

II I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3904 C CAT GT GT T CAT CAT CT TAAGT AT T GT AAGCT G CTAT GT AT G GAT T T AAAC C GTAAT CAT 3963 

Qy 309 9 AT TT GTTTTTCCT GT AT GAGGC AC T GGT GAAT AAAC AAAGAT CT GAGAAAG CT GT AT AT T 3158 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3964 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC C T GT AT AT T 4 010 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCTGAC 3217 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 4 011 T T AC T T T GT T GC AGAT AGT C T T GC C GC AT C T T G G C AAGT T G C AGAGAT G GT GGAGC TAG A 4 070 

Qy 3218 AGAAATAA 3225 

I III II 
Db 4071 AAAAAAAA 4078 
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ID AAZ56886 standard; DNA; 3579 BP. 
XX 

AC AAZ56886; 
XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide encoding DNA. 
XX 

KW MAGI protein; neuroendocrine-specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective ; neuroprotective; ds . 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .3579 

FT /*tag= a 

FT /product^ "MAGI polypeptide" 
XX 

PN WO200005364-A1. 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 9 9WO-GB002360 . 
XX 

PR 22-JUL-1998; 98GB-00016024 . 

PR 19-JUL-1999; 99GB-00016898 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR P-PSDB; AAY56967. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders. 
XX 

PS Claim 5; Page 19-20; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents a DNA encoding the human MAGI protein 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 U; 0 Other; 



Query Match 58.0%; Score 2223.6; DB 3; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 



Matches 2519; Conservative 



0; Mismatches 339; Indels 22; Gaps 



Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GT AC T G C C C AC T GAAG GAAC AC T T C C AG C AAC T T C AAAT GAAG C T T C T AAAG CAT T C T C A 120 

III I I I I I I I I I M I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 

Db 772 GT AT T AC C C AC T GAAG GAAC AC T T C AAGAAAAT GT C AGT GAAG CT T C T AAAG AG GT C T C A 831 

Qy 121 GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T CAGAAT T G GAAT AT 180 

I I I I I I I I I I I I I II I I I I I I III I I I I I I I I I I I I I II I I I I I I II I I I 
Db 832 GAGAAGG C AAAAAC T C TACT C AT AGAT AGAGAT T T AAC AGAGT T T T CAGAAT T AGAAT AC 8 91 

Qy 181 TCAGAAAT GGAAT CAT CAT T C AGT GGCT CT CAAAAGGCAGAACCT GCCGTAACAGTAGCG 240 

I I I I I I I I I I I I I I I I I I I I I I I Mill III II I I I I I II t I I I II I I I I II 
Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCT^AAAGCAGAATCTGCCGTAATAGTAGCA 951 

Qy 241 AAT C C TAG G GAC GAAAT AGT T GT GAG GAGT AG AGAT AAAG AAGAG G ACT T AGT T AGT C T T 300 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I MINIMI I 
Db 952 AAT CCTAGGGAAGAAATAAT CGTGAAAAATAAAGAT GAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 301 AACAT CCTT CATACT CAGCAGGAGTTATCTAC AGT CCTTAC GAAAT CAGTT GAAGAAGAA 360 

I I II I II I I I I II III II I I II I I I I I I I I I I I I I I I I I III MM I I 

Db 1012 AACAT CCTT CAT AAT CAACAAGAGT T AC CT AC AG C T CTT ACT AAAT T GGT T AAAGAG GAT 1071 

Qy 361 GAT AGAGT T CT GT CT C C AGAAAAAAC AAAG GAC AGT T TT AAG GAAAAGGGAGTT GCAGC A 420 

II MM I I I I I I I II I I I I MM I I II I I I I I I I II I I I I I I I II II I I 

Db 1072 GA AGT T GT GT CTT CAGAAAAAGCAAAAGAC AGTTTTAAT GAAAAGAGAGTT GCAGT G 112 8 

Qy 421 GAAGCT T C TAT GGG GGAG GAAT AT GCAGAC T T C AAAC CAT T T GAGC GAGT AT GG GAAGT G 480 

I I II I I II I I I I I II I I II I I I I II M I I I 1 I I I I I I 1 I I M II I I I II I I M I I I II 
Db 112 9 GAAGCT C C TAT GAG GGAG GAAT AT G CAGAC T T C AAAC CAT T T GAGC GAGT AT GGGAAGT G 118 8 

Qy 481 AAAGAT ACTT ACAAGCAAGAT AGT GAT GTTTT GATT GCT GGAGGTAAT ATAGAGAGCAAA 540 

I II I II I I III I I I I I I I I I II I III I I I I I I I M I M M I I I II II I 
Db 118 9 AAAGATAGT AAG GAAGAT AGT GAT AT GTTGGCTGCTG GAGGT AAAAT C GAGAGC AAC 1245 

Qy 541 T T GGAAGGT AAAGT G GAT AAGAAACAC TT T T C AGAT AGC CTT GAACAAAC AAAT C GT GAA 600 

MINI I I M I II M I II I III Ml I II I I I I I II I II I II I I I I I I III 
Db 124 6 TT GGAAAGT AAAGT G GAT AAAAAAT GT TT T GC AGAT AGC CTT GAGCAAAC TAAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGCAGTAATGAT GAC ACTT CAT T T CCCAGT ACAC CAGAAGCT GTAAGA 660 

I II I I II I I I I II I I I II I M I M I I I I I M I I I M I I I II I I II I I ML 
Db 1306 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T C T T T C C CC AGT AC GC CAGAAG GT AT AAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAAT GT T 717 

I I II I II II I II I I I II II I I I M II II I I I I I II I I I M I II I II 

Db 1366 GAT C GT C C AGGAGCAT AT AT C ACAT GTGCTCCCTT T AAC C CAGCAGCAAC T GAGAG CAT T 1425 

Qy 718 T C AAC AAAC AT TTTTCCCTTGTTG GAAGAT CAT AC T T C G GAAAAT AAGAC AG AT G AAAAA 777 

II I I I II I I I II II II Mill I Mill I I I II I II I M I I I I I I I I I I I I I M 
Db 1426 GCAACAAACAT T T TT C CTTT GTT AGGAGAT C CT ACT T CAGAAAAT AAGAC C GAT GAAAAA 14 85 

Qy 77 8 AAGAT AGAA- AAAAAAAGGC ACAAAT T GT AAC AG AGAAGAAT GCAAGT GT C AAGACAT C A 836 

II II M II I II I I II I I I I I I I I II I I I I I I I I I I M I M III II M M 

Db 14 8 6 AAAAT AGAAGAAAAGAAGG C C CAAAT AGT AAC AG AGAAGAAT ACT AG CAC CAAAAC AT C A 1545 



Qy 837 AACC CT TT CCTTAT GGC AGCACAGGAGT CTAAGACAGATTAC GTTACAACAGAT CAT GT G 896 

I I I I I I I I III I II I I I I I I I I I III I II I I I I I I I II I I I I I I I I I II I 
Db 1546 AAC CCTTTTCTT GT AG C AG C AC AGGAT T C T GAGAC AGAT TAT GT CACAAC AGAT AAT T T A 1605 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I I I I I I I I I II I I I I I I I I I I I II I I I I I II II I I I I II II I I I I I I I I II 
Db 1606 ACAAAG GT G AC T GAGGAAGT C GT G G CAAAC AT GC CT GAAG G C C T GACT C C AGAT T T AGT A 1665 

Qy 957 C AG GAAG CAT GT GAAAGT GAAT T GAAT GAAGC T ACT G GT ACAAAAAT T G C C TT T GAAAC A 1016 

I I I I I II II I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1666 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGT TACT GGTACAAAGATT GCTTAT GAAACA 1725 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I 
Db 1726 AAAATG GACT T GGTT CAAAC AT CAGAAGTT AT GCAAGAGT CACTCTAT C CT GCAGCACAG 17 85 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 178 6 CT T T GC C CAT CAT T T GAAGAGT C AGAAGC T AC T C CT T C AC C AGT T T T GC C T G AC AT T GT T 18 4 5 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I II I I II I I III 
Db 1846 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 T CAT C AC CAT T AGAAACT CT T C CT T CAGT T AAT TAT GAAAG C ATAAAGT T T GAGC CT GAA 1256 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 1906 T CAT C AC CAT T AGAAG C T T CT T CAGT T AAT TAT GAAAGC ATAAAAC AT GAGC C TGAA 1962 

Qy 1257 AAT C CCC CAC CAT AT GAGGAGGCCAT GAAT GT AT CACTAAAAAAAGAAT CAGGAAT GAAT 1316 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I II 

Db 1963 AAC C CC C CAC CAT AT GAAGAGGC CAT GAGT GT AT CAC T AAAAAAAGT AT CAGGAAT AAAG 2022 

Qy 1317 GAAGAAAT CACAGAGCCTGAAGGTATTAGT GTAGCTGTT CAGGAAAC AGAAG CT C CTT AT 1376 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2023 GAAGAAAT T AAAGAGC CT GAAAAT AT TAAT GCAG CT CT T CAAGAAACAGAAG CT C CTT AT 2082 

Qy 1377 ATAT CTATTGCATGTGATTTAATTAAAGAAACAAAGATCT CTACTGAACCGACTCCAGAT 1436 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I III 
Db 2083 ATAT CT AT T G CAT GT GATT TAAT T AAAGAAACAAAGCT T T CT G CT GAAC C AGC T C C GGAT 2142 

Qy 1437 T T C T CT AGT TAT T C AGAAAT AG C AG AAGT T G CAC AG C CAGT G C C C GAG CAT T C T GAG C T A 1496 

I I I I I I I I I I I II I I I I I III I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I 
Db 2143 T T C T CT GAT TAT T C AGAAAT GGCAAAAGT T GAAC AGC CAGT GC C T GAT CAT T C T GAGC T A 22 02 

Qy 14 97 GT T GAAGAT T C CT C C C C CGAT T CT GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC C C 1556 

I I I I I I II I I I I I I II I I I I I I I I I I I II I I I I I I I I I I II I I I I I II I I I I I I I I I 

Db 2203 GT T GAAGAT T C CT CAC C T GAT T C T GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC C T 2262 

Qy 1557 GAAGTT CCACAAAAACAAGAT GAAGCT GTAATACTTGT GAAAGAAAACCT CACT GAAATT 1616 

II I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I 

Db 2263 GACGTT CCACAAAAACAAGAT GAAACT GT GAT GCTTGT GAAAGAAAGT CT CACT GAGAC T 2322 

Qy 1617 T CAT CT GAGT CAAT GAC AGGACAT GAC AAT AAG G GAAAAC T CAGT G CT T CAC CAT CAC C T 1676 

I I I I I I I I I II I I I I II I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 2323 T C AT TT GAGT CAAT GAT AGAAT AT GAAAAT AAG GAAAAAC T CAGT G CT T T G C CACCT 2379 



1677 GAGG GAGGAAAAC C GT AT T T G GAGT C T T T T CAGC C CAGT T T AG GC AT CACAAAAG AT AC C 1736 

I I I M I I I I I I II I I I I I I I I I I I I I I III I I I I I I I I I 1 I 1 I I I I I I I II I 
238 0 GAG G GAGGAAAG C CAT AT T T G GAAT C T T T T AAG CT CAGT T T AGAT AAC ACAAAAGAT AC C 2439 

1737 TT AG C AC CT GAT GAAGT T T C AG CAT T GAC C C AAAAG G AGAAAAT C C CT T T G C AG AT GGAG 1796 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I 

24 40 CT GT T AC C T GAT GAAGT T T CAAC AT T GAG C AAAAAG GAGAAAAT T C C TT T GC AGAT G GAG 24 99 

1797 GAG C T CAAT AC T G CAGT T TAT T CAAGT GAT G GCT TAT T CAT T GCT C AGGAAGCAAAC CT A 1856 

I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I III II I I II I I I I I II 
2500 GAG CT CAGT AC T G CAGT T TAT T CAAAT GAT GAC T T AT TT AT T T C T AAGGAAGCAC AGAT A 2 559 

1857 AGAGAAAGT GAAAC AT T T T C AGAT T C AT CT C C GAT T GAGATT AT AGAT GAGT T C C C GAC C 1916 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

2560 AGAGAAACT GAAAC GT T T T CAGAT T C AT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT AC A 2 619 

1917 TT T GT CAGT T CTAAAG CAGAT T C T T CT C CT AC AT T AG CC AGGGAAT AC AC T GAC C TAGAA 197 6 

II II I I I I I I I I I I I I I I I I I III I I I I I I I II I I I I II II I I I I I I I I I I 

262 0 TT GAT CAGT T CT AAAACT GAT T CAT T T T C T AAAT TAG CC AGGGAAT AT ACT GAC C TAGAA 2 679 

1977 GT AG C C CACAAAAGT GAAAT T G CT GAC AT C CAG GAT G GAG CT GG GT CAT T G GCT T GT GC A 2 03 6 

III II I I II I M I I I I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I II 
2680 GT AT C C CACAAAAGT GAAAT T GCT AAT GC C C C G GAT GGAG C T GG GT CAT T GC CT T G C AC A 2 73 9 

2037 GGAT T GC C C CAT GAC CTTTCTTT CAAGAGT AT ACAAC C T AAAGAGGAAGT T CAT 2090 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

274 0 GAATT GC C C CAT GAC CTTTCTTT GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 279 9 

2091 GT C C CAGAT GAGT T CT C CAAAGAT AG G GGT GAT GT T T CAAAGGT GC C C GT ACT GC C T CC A 2150 

II I I I I I I I II II III II II II I I I I I I I I I I I I II I I I I I I I I 

28 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2 859 

2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AG GCAGCAT AGAAAAAC C CAAAGT T CT T 2210 

I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I 
28 60 GAT GTTTCTGCTTTGGC C ACT CAAG CAGAGAT AGAGAGCAT AGT TAAAC C CAAAGT T CTT 2 919 

2211 GT GAAAGAAGC C GAGAGAAAACT T C CT T CT GAT ACAGAAAAAGAG C GAAGAT CT C CAT CT 227 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I MINI 
292 0 GT GAAAGAAGCT GAGAAAAAACT T C C T T C C GAT ACAGAAAAAGAG GAC AGAT C AC C AT CT 2 97 9 

2271 GCTATATTTT CAGC AGAGCT GAGT AAAACTT CAGTT GTT GACCT CCT CTACTGGAGAGAC 233 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I II I I I I I II I I I I I 
2 980 GC TAT AT T T T C AGC AGAG CT GAGT AAAACT T CAGT T GTT GAC CT CCT GT ACT GGAGAGAC 3039 

2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II II I II I I I I I I I I 
304 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

23 91 T T CAGCAT T GT GAGT GTAACGGCCTACATTGC CTT GGCCCTGCTCTCTGTGACT AT CAGC 2450 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I 
3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

24 51 TT TAGGATATATAAGGGT GT GAT C CAGGCT AT CCAGAAAT CT GAT GAAGGC CACCCATT C 2 510 

I I I I I I I I I I I II I I I II I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I 
3160 TTTAGGAT ATACAAGGGT GT GAT C CAAG C TAT CCAGAAAT CAGAT GAAGGC CACCCATT C 3219 

2511 AG G GC AT AT T T G GAAT C T GAAGT T G C TAT AT CT GAG GAGT T G GT T C AGAAGT AC AGCAAT 2570 



Db 3220 AGG G CAT AT CT G GAAT CT GAAGT T G C TAT AT CT GAGG AGT T G GTT C AGAAGT AC AGT AAT 3279 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I II I I I III I I I I I I I I II I Mill I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 32 8 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I M M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3340 GAT T T AGT T GAT T CT CT GAAGT T T GCAGT GT T GAT GT G GGT AT T T AC C TAT GTT G GT G C C 3399 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I 
Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 59 

Qy 2751 AT T TAT GAAC G G CAT CAGGC G CAAAT AGAT CAT TAT CT G GGAC T T GCAAAT AAGAAT GT T 2 810 

M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I 
Db 34 60 AT TT AT GAAC G GC AT C AGG C GC AGAT AGAT CAT TAT CT AG GACT T G CAAAT AAGAAT GT T 3519 

Qy 2811 AAAGAT GC T AT G GC TAAAAT C CAAG CAAAAAT C C C T GG AT T GAAG CGTAAAGC T GAAT GA 2 870 

I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 352 0 AAAGAT GCT AT GGCTAAAAT CCAAGCAAAAAT C C CT GGATT GAAGCGCAAAGCT GAAT GA 3579 



RESULT 9 
AAF90324 

ID AAF90324 standard; cDNA; 3579 BP. 
XX 



AC AAF90324; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A cDNA. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; brain injury; 

KW stroke; neuronal degeneration; Alzheimer's disease; Parkinson's disease; 

KW neuromuscular disorder; psychiatric disorder; developmental disorder; 

KW neuroprotective; nootropic; neuroleptic; antiparkinsonian; 

KW cerebroprotective; neuroleptic; diagnosis; therapy; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2000WO-GB004345 . 
XX 

PR 15-NOV-1999; 99GB-00026995 . 

PR 24-JAN-2000; 2000GB-00001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR P-PSDB; AAB82349. 



XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer ! s and Parkinson's diseases. 
XX 

PS Disclosure; Page 25-26; 25pp; English. 
XX 

CC The present sequence is that of cDNA encoding human NOGO-A (see 

CC AAB82349) . NOGO-A is a previously known splice variant of the human NOGO 

CC gene on chromosome 2p21. NOGO-A cDNA was obtained by PCR amplification of 

CC human spinal cord cDNA. The invention relates to a novel splice variant, 

CC NOGO-C (see AAF90323). It provides NOGO-C polypeptides and 

CC polynucleotides, and methods for producing such polypeptides by 

CC recombinant techniques. Also disclosed are methods for utilising NOGO-C 

CC polypeptides and polynucleotides in the treatment of diseases including 

CC neuropathies, spinal injury, brain injury, stroke, neuronal degeneration, 

CC for example Alzheimer's disease and Parkinson's disease, neuromuscular 

CC disorders, psychiatric disorders and developmental disorders. Also 

CC provided are methods for identifying agonists and agonists for use in 

CC treating conditions associated with NOGO-C imbalance, and diagnostic 

CC assays for detecting diseases associated with inappropriate NOGO-C 

CC activity or levels 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 U; 0 Other; 

Query Match 58.0%; Score 2223.6; DB 4; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I MM II II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGT^AGCTTCTAAAGCATTCTCA 12 0 

III I I II M I II M I I I I II II I II II I I I I I I I I I I II I I I I Mill 

Db 772 GTATT AC CCACT GAAGGAACACTT CAAGAAAAT GT CAGT GAAGCTTCTAAAGAGGT CTCA 831 

Qy 121 GAGAAGGCAAAAAAT C CATTT GT AGAGAGAAATTTAACAGAATTTTCAGAATT GGAATAT 180 

I I I I I I II I I I I I II I I II I I III I II I I I I I II I II I I I M II I II I I I 
Db 832 G AGAAG G C AAAAAC T C T AC T CAT AG AT AG AG AT T T AAC AG AGT T T T C AGAAT T AG AAT AC 891 

Qy 181 T CAGAAAT GGAAT CAT CAT TCAGT GGCT CT CAAAAGGCAGAACCT GC C GTAACAGTAGCG 24 0 

I I II I I I II I I I I I I I I I I I II I I II II III I II I I I I II I I I M I I II I I I 
Db 8 92 T CAGAAAT G G GAT CAT C GT T CAGT GT C T C T C C AAAAGC AG AAT C T GC C GT AAT AGT AG C A 951 

Qy 241 AAT CCTAGGGACGAAATAGTT GTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 300 

I I I I I I II I I I II I I I I I I I II I II II I I I I I I I I II I I II I I I I I I I 
Db 952 AAT C C T AG GGAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGTT AGT T AGT AAT 1011 

Qy 301 AAC AT C C T T CAT ACT CAG C AG GAGT TAT C T AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 

I II I I I I I I II I I III II I II I I I II II I I Mill II I I III II II II 

Db 1012 AACAT C C T T CAT AAT CAAC AAGAGT T AC C T ACAG C T CT T AC TAAATT GGT T AAAGAGGAT 1071 

Qy 361 GAT AG AGT T CT GT C T C C AGAAAAAACAAAG GAC AGT T T T AAG GAAAAG GG AGT T G CAG C A 420 

II I II I I I I I I I II II I II MM II II I I I I II I II II II I II I I I I II 

Db 1072 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1128 



Qy 421 GAAGCTT CT AT GGGGGAGGAATAT GCAGACT T CAAAC CAT T T GAGC GAGTAT GGGAAGT G 4 80 

MINI I I I I I I I I I I I I I I II I I ! I I I I I I I I I I I I I I I | | | | M II I II I I I I I I I 
Db 1129 GAAGC T C C TAT GAG GGAGGAAT AT GCAGACT T CAAAC CAT T T GAGC GAGTAT G G GAAGT G 118 8 

QY 4 81 AAAGAT AC T T ACAAG CAAGAT AGT GAT GT T T T GAT T G C T GGAG GT AAT AT AGAGAG CAAA 54 0 

I I M I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I 
Db H89 AAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCT GGAGGT AAAAT C GAGAG CAAC 1245 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

I I I I I I I I I I I I I I I I I I I III III I I I I I I I I I I I | | | | I I I I I I I II I 
Db 1246 T T GGAAAGT AAAGT GGAT AAAAAAT GT T T T GC AGAT AG C C T T GAG CAAAC TAAT CAC GAA 1305 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GAC ACT T CAT T T C C C AGTAC AC C AGAAG CT GTAAGA 660 

I I M I M I M I II I I I I I I II I I I I I I I I M I I I I I I | | | | | | | || | Ml 
Db 1306 AAAGATAGT GAGAGT AGTAAT GAT GAT ACT TCTTTCCC C AGT AC GC C AGAAG GT AT AAAG 1365 

Qy 661 GGT GGTT C C GGAGC GTACAT CAC GT GT GCT C C CT TTAAC C — - - C AACAACT GAGAAT GTT 717 

I I M I I I II I II I I I I I I I I I I I I II I I | | | | | || I | | | | | | | | M 

Db 1366 GAT C GT C CAG GAG CAT AT AT CAC AT GT GCT C C CT T T AAC C CAG CAG CAAC T GAGAG CAT T 1425 

Qy 718 T CAACAAACAT T TTT CC CTT GTT GGAAGATCATACTT CGGAAAATAAGACAGAT GAAAAA 777 

I I I I I I M I I I M I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I II II II I 
Db 142 6 GCAACAAAC AT TTTTCCTTTGT TAG GAGAT C C T ACT T C AGAAAAT AAGAC C GAT GAAAAA 14 85 

Qy 77 8 AAGATAGAA- AAAAAAAGGCACAAATT GT AAC AGAGAAGAAT GCAAGT GT CAAGACAT CA 836 

II I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I II I II III MINI 

Db 14 8 6 AAAAT AGAAGAAAAGAAGGC C CAAAT AG TAAC AGAGAAGAAT ACT AGC AC C AAAAC AT C A 1545 

Qy 837 AAC C C T T T C CT T AT GGCAG CAC AGGAGT C T AAGACAGATT AC GT TACAAC AGAT CAT GT G 8 96 

M I I I I I I Ml | | M I I I I I I I I III I I I I I I I I I I || I I I II I I I I II I 
Db 154 6 AAC CCTTTTCTT GT AGC AG CAC AGGAT T CT GAGAC AGATT AT GT CACAAC AGAT AAT T T A 1605 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I II I II II II I I I I I I II 
Db 1606 ACAAAG GT GAC T GAGGAAGT C GT G GCAAAC AT GC CT GAAGG C CT GAC T C C AGAT T T AGT A 1665 

Qy 957 CAG GAAGC AT GT GAAAGT GAAT T GAAT GAAGC TACT GGT AC AAAAAT T GC C T T T GAAAC A 1016 

I I I I I I I I I I M I I I I I I I I I I I I I I I I || I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 1666 CAG GAAG CAT GT GAAAGT GAAT T GAAT GAAGT TAC T GGT ACAAAGAT T G C T TAT GAAAC A 1725 

Qy 1017 AAAATG GAC C T GGT T CAAACT T C AGAAG C T GT G C AGGAGT CAC T T TAC C CT GT AAC AC AG 1076 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I 
Db 172 6 AAAAT GGAC TT G GT T CAAACAT C AGAAGT TAT GCAAGAGT CACT C TAT C C T G C AGCAC AG 17 8 5 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I M I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I M I I I I I I I I I I I I I I I I I I | M I I I I I I I I I I I M I I I I I I I I I Ml 
Db 184 6 AT GGAAG CAC CAT T GAAT T C T G C AGT T CCT AGT G CTGGT GCT T C C GT GAT ACAGC C CAG C 1905 

Qy 1197 T CAT CAC CAT T AGAAACT C T T C C T T C AGT TAAT TAT GAAAGC AT AAAGT T T GAG CCT GAA 1256 

I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I II I 
Db 1906 T CAT CAC CAT T AGAAG C T T C T T C AGT TAAT TAT GAAAG CAT AAAAC AT GAG CCT GAA 1962 

Qy 1257 AAT C C C C CAC CAT AT GAG GAG G C CAT GAAT GT AT C ACT AAAAAAAGAAT CAG GAAT GAAT 1316 



Db 1963 AAC C C C C C AC CAT AT GAAGAGGC CAT GAGT GT AT CAC T AAAAAAAGT AT C AG GAAT AAAG 2022 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGT^GCTCCTTAT 1376 

I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I II I II I I I I I I I I I I I | I I I 
Db 2 023 GAAGAAAT T AAAG AGC C T GAAAAT AT T AAT G C AG C T C T T C AAGAAAC AG AAG C T C C T TAT 20 82 

Qy 1377 AT AT C TAT T GC AT GT GAT T TAAT T AAAGAAACAAAGAT C T CT ACT GAAC C GACT C CAGAT 1436 

I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I III 
Db 2083 AT AT C TAT T GC AT GT GAT T TAAT T AAAGAAACAAAGCT T T CT G CT GAAC C AGCT C C G GAT 2142 

Qy 1437 T T CT C T AGT T AT T C AGAAAT AG CAGAAGT T GC ACAGC C AGT G C C C GAGC ATT CT GAGC T A 14 96 

I I M I I I I I I I I I I I I II I M | | | | | I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 2143 T T CT C T GAT TAT T C AGAAAT GGCAAAAGT T GAACAGC C AGT G C C T GAT CAT T CT GAGC T A 2202 

Qy 1497 GT T GAAGAT TCCTCCCCC GATT C T GAAC C AGT T GACTT AT T T AGT GAT GAT T CAAT AC C C 1556 

I M I I II I I I I I I I II I I I I I I || I I | | | | | | | | | | | | | | | | | | | | | M | | | | | | M 
Db 2203 GT T GAAGAT T C C T CAC C T GATT CT GAAC C AGT T GACT TAT T T AGT GAT GAT T CAAT AC C T 2262 

Qy 1557 GAAGT T C C ACAAAAACAAGAT GAAG C T GTAAT ACT T GT GAAAGAAAAC C T CAC T GAAAT T 1616 

M I I I M I I I M I I I I I I I I I I I I I II II I I I I I I I II I I I I I I I I I I I I I I 
Db 22 63 GAC GT T C C ACAAAAACAAGAT GAAACT GT GAT G CT T GT GAAAGAAAGT CT C ACT GAGAC T 2322 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCTT CAC CAT C AC CT 167 6 

I I I I I I I I I M I M I II I I I I I I I I I I I I | | || || | | | | | || | | | | | | | 
Db 2323 TCATTTGAGT CAAT GATAGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 2379 

Qy 1677 GAGG GAGGAAAAC C GT AT T T GGAGT C T TT T C AGC C CAGT T T AGG CAT C AC AAAAGAT AC C 1736 

I M I I I I I M I II I I II I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I 
Db 2380 GAG GGAGGAAAGC CAT AT T T GGAAT C T T T TAAGCT CAGT T TAGATAACAC AAAAGAT AC C 2439 

Qy 1737 TT AGCAC CT GAT GAAGT T T C AGC AT T GAC C CAAAAGGAGAAAAT C C CT T T G CAGAT GGAG 1796 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I || I | | | | | | | | | M | 
Db 244 0 CT GT T AC CT GAT GAAGT T T CAACAT T GAG CAAAAAGGAGAAAAT T C CT T T GC AGAT GGAG 24 99 

Qy 17 97 GAG CT CAAT ACT G C AGT T TAT T CAAGT GAT GGCT TAT T CAT T GC T C AGGAAGCAAAC CT A 1856 

I I I I I M I I I I I I I I I I I I I I I I I I II II I I I II I III II I I I I I I I I I II 
Db 2500 GAGCT CAGT AC T GCAGT TT AT T C AAAT GAT GACT TAT T TAT T T CTAAGGAAGC AC AGAT A 2559 

Qy 18 57 AGAGAAAGT GAAAC AT T TT CAGAT T CAT C T C C G ATT GAGAT TAT AGAT GAGT T C C C GAC C 1916 

I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I M I I II 
Db 2560 AGAGAAACT GAAAC GT T T T CAGAT T CAT CT C CAAT T GAAAT T ATAGAT GAGT T C C C T AC A 2 619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 197 6 

M I I I I I I I I I I I I I I I I I I I III I I I I I II I I I II I I I I I I I I I II I I I I 
Db 262 0 T T GAT C AGT T C T AAAACT GAT T CAT T T T C T AAAT T AGC C AG G GAAT AT ACT GAC CT AGAA 2679 

Qy 1977 GT AGC C C ACAAAAGT GAAAT T G C T GAC AT C C AG GAT G GAG CT GG GT CAT TGGCTTGT GC A 2036 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2680 GT AT C C C AC AAAAGT GAAAT T GCTAAT G C C C C G GAT GGAG CT G GGT CAT T G C C T T G CAC A 2739 

Qy 2037 GGAT T GC C C CAT GAC CTTTCTTT CAAGAGT AT ACAAC C T AAAGAGGAAGT T CAT 2 09 0 

I I I I I I I I I I I M I II I II I I I I I I I I M I I I I I I I I I I I I I I I 

Db 2740 GAAT T G C C C CAT GAC CTTTCTTT GAAGAACAT AC AAC C CAAAGT T GAAGAGAAAAT CAGT 2799 

Qy 2 091 GT C C CAGAT GAGTT C T C CAAAGAT AGGG GT GAT GT T T CAAAG GT GC C C GT ACT G C C T C C A 2150 

I I I M I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 



Db 2 8 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2 859 

Qy 2151 GAT GTTTCTGCTTTG GAT GC T C AAGCAGAGAT AGG CAGC AT AGAAAAAC C C AAAGT T CT T 2210 

M I I I II I I I I I I II I I I I I I I I I || | | || | | | | || || I I I M II I I I I M I I 
Db 2860 GAT GTTTCTGCTTTGGC C AC T C AAG C AG AGAT AGAG AGC AT AGT T AAAC C C AAAGT T C T T 2919 

Qy 2211 GT GAAAGAAG CC GAGAGAAAAC TTCCTTCT GAT ACAGAAAAAGAGC GAAGAT C T C C AT CT 2270 

I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I | I I I 
Db 292 0 GT GAAAGAAGCT GAGAAAAAACT T C CT T C C GAT ACAGAAAAAGAG GACAGAT C AC CAT CT 2 979 

Qy 22 71 GC T AT AT T T T CAGCAGAG C T GAGT AAAACT T CAGT T GT T GACCT C C T C TACT G GAGAGAC 2330 

I M I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I II II I I I I I I I I 
Db 2 980 GC T AT AT T T T CAGCAGAG C T GAGTAAAACT T C AGT T GT T GACC T C C T GT ACT G GAGAGAC 3039 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I II I I 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 309 9 

Qy 23 91 TTCAGCATTGT GAGT GTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTAT CAGC 245 0 

M I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I || M | | | | | | | | | | | || | M | 
Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGAC CAT CAGC 3159 

Qy 24 51 T T TAG GAT AT AT AAG G GT GT GAT C C AG G C TAT C C AGAAAT C T GAT GAAG G C C AC C CAT T C 2510 

I I I I I I I M M I I I I I I I I I I I I I I I I I II I I II I I I II I I I I I I I I I I I I I I I I I I 
Db 3160 T T T AGGAT AT ACAAGGGT GT GAT C CAAG CT AT C C AGAAAT C AGAT GAAGGC C AC C CAT T C 3219 

Qy 2511 AGGG CAT AT T T G GAAT CT GAAGT T GCT AT AT CT GAGGAGT T GGT T C AGAAGT ACAGC AAT 2570 

I M I I I I I I I I I I I I I II I I I I I I I I I M I I I I | | | || | | | | | | | | | | | M | | | | Ml 
Db 322 0 AGGGCATATCT GGAAT CT GAAGT T GCT AT AT CT GAGGAGT T GGT T C AGAAGT ACAGT AAT 327 9 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 3280 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GAT T TAGT T GAT T CT C T GAAGT T T G CAGT GTT GAT GT GG GT AT TT AC CT AT GTTGGTGCC 2690 

I I I I I I M I I I I I I I I I I I I I I II II I I I I I I I I I I II I II I I II I I I I I 

Db 334 0 GATTTAGTTGATTCTCT GAAGT TTGCAGT GTT GAT GTGGGTATTTACCTAT GTTGGTGCC 3399 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I M I I I I I I I I I II II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 3400 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3459 

Qy 2751 AT T TAT GAAC GGC AT C AG G C G C AAAT AGAT CAT TAT C T GGGACT T GCAAATAAGAAT GT T 2810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 3460 AT T TAT GAACGG C AT C AG GC GC AGAT AG AT CAT T ATCT AGGACT T GCAAATAAGAAT GTT 3519 

Qy 2 811 AAAGAT GCT AT GGC TAAAAT C CAAG CAAAAAT C C CT GGAT T GAAG C GTAAAGCT GAAT GA 2870 

I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I 
Db 352 0 AAAGAT G C TAT GGC TAAAAT C CAAG CAAAAAT C C C T GGAT T GAAG C GC AAAGCT GAAT GA 3579 

RESULT 10 
ABK90134 

ID ABK90134 standard; DNA; 3579 BP. 
XX 

AC ABK90134; 
XX 



DT 21-OCT-2002 (first entry) 
XX 

DE DNA encoding human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

KW neuroblastoma; hyperprolif erative disorder; dysprolif erative disorder; 

KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 

KW Nogo-associated disease; metastasis; gene; ds . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .3579 

FT /*tag= a 

FT /product= "Human NogoA protein" 
XX 

PN WO200257483-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2002WO-GB00022 8 . 
XX 

PR 18-JAN-2001; 2001GB-00001312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR P-PSDB; ABG30938. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries , neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides . 

XX 

PS Disclosure; Page 53-58; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprolif erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 



CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present nucleic acid sequence encodes the human NogoA 

CC protein of the invention 
XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 U; 0 Other; 

Query Match 58.0%; Score 2223.6; DB 6; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I IN I I I I I I I I I I I I I II I I I I I MM II II 
Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATG7^ATACCTTGGTAATTTGTC7iACA 771 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 

Ml I li M I I I II II II II I I I I II II I I I I I II I I I I I I I II I II II 
Db 772 GT AT T ACC C AC T GAAGGAAC AC T T C AAGAAAAT GT C AGT GAAG CT T C T AAAGAGGT CT C A 831 

Qy 121 GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAAT T TAAC AGAAT T T T C AGAAT T GGAAT AT 18 0 

I II I I I I I II I II II I I MM III I II I I I I II I I II I I I I II II I I II I 
Db 832 GAGAAG GCAAAAACT CT ACT C AT AGAT AGAGAT T T AAC AGAGT T T T C AGAAT T AGAAT AC 891 

Qy 181 T CAGAAAT GGAAT CAT CAT T CAGT GGCT CT CAAAAGGCAGAACCT GCCGTAACAGTAGC G 24 0 

M II I I I II I MUM II I II I I I II II III I I I I I I I I II II II I I I I I I I 

Db 8 92 T CAGAAAT GGGAT CAT C GT T CAGT GT C T C T C CAAAAGC AGAAT C T GC C GT AAT AGT AGC A 951 

Qy 241 AAT CCTAGGGAC GAAATAGT T GT GAGGAGT AGAGAT AAAGAAGAGGACTTAGTTAGT CTT 300 

I II I II I I I I I II I I II I I II I I II I II I I II I I I I I I I I M I I I I I I 
Db 952 AAT C CT AG G GAAGAAATAAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 1011 

Qy 301 AACAT CCTT CAT ACTCAGCAGGAGTTAT CT ACAGT CCTTACGAAAT CAGTT GAAGAAGAA 360 

I I I I I I I II I I I I III M II I I I I I I II I I I I I I I MM III MM M 

Db 1012 AACAT CCTT CAT AAT CAACAAGAGT T AC C T AC AGC T CT T ACT AAAT T GGTT AAAGAG GAT 1071 

Qy 361 GAT AGAGT T CT GT CT C CAGAAAA?UVCAAAG G AC AGTTT TAAG GAAAAGGGAGT T GCAGCA 420 

II MM I I I M I I I I I I M I I I I I I I II I I II II I I I II I I I I I M I I I 

Db 1072 GA AGT T GT GT C T T C AGAAAAAG C AAAAGAC AGT T T T AAT G AAAAG AG AGT T GC AGT G 112 8 

Qy 421 GAAG CT T C TAT GG GGGAGGAAT AT GC AGACT T CAAACC AT T T GAG C GAG TAT GGGAAGT G 4 80 

I I I I II I II I I I I II II I I I I I I I I II I I II I II I I I I I I I I I I I II I I II II I I II I 
Db 112 9 GAAG CT C C TAT GAGGGAG GAAT AT GC AGACT T C AAACCAT T T GAG C GAGT AT GGGAAGT G 118 8 

Qy 481 AAAGAT ACT TACAAGCAAGATAGT GAT GTTTT GAT T GCT GGAGGTAATAT AGAGAGCAAA 54 0 

I II I I I I I III I I II I II I I I I I III I I I II I I I I I II II I II II I I I 
Db 1189 AAAGATAGT AAGGAAGATAGT GAT AT GT T GGCT GCT GGAGGTAAAAT C GAGAGCAAC 1245 

Qy 541 TT GGAAGGTAAAGT GGATAAGAAACACT TT T CAGATAGC CT T GAACAAACAAAT C GT GAA 600 

I I I I II I I I II I I I I I I I I III III II II I I II I II I I II I I I II I I Ml 

Db 1246 T T G GAAAGT AAAGT GG AT AAAAAAT GT T T T G CAGAT AG CCTT GAGCAAACT AAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GAC AC T T CAT T T C C CAGT AC AC C AGAAG C T GT AAGA 660 

II II II I II I I M II I I I I I I I II I I I II II I I I I I I II I I I II I I I III 

Db 1306 AAAGATAGT GAG AGT AGT AAT GAT GAT AC TTCTTTCCC CAGT AC G C C AGAAG GT AT AAAG 1365 



Qy 



661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAAT GTT 717 



Db 1366 GAT C GT C C AG GAG CAT AT AT C AC AT GTGCTCCCTT T AAC C C AG C AG C AAC T GAGAG CAT T 1425 

Qy 718 T C AAC AAAC AT TTTTCCCTTGTTG GAAG AT CAT AC T T C G GAAAAT AAG AC AG AT G AAAAA 777 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 1426 GCAAC AAAC AT TTTTCCTTT GT T AG G AGAT C CT ACT T C AGAAAAT AAGAC C GAT GAAAAA 1485 

Qy 778 AAGAT AGAA- AAAAAAAGGC ACAAAT T GT AACAGAGAAGAAT G CAAGT GT CAAGACAT C A 8 36 

II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II III I I I I I I 
Db 148 6 AAAAT AGAAGAAAAGAAG G C C CAAAT AGT AAC AGAGAAGAAT AC T AGC AC CAAAACAT C A 1545 

Qy 837 AACCCTTT CCT TAT GGCAGCAC AGGAGT CTAAGACAGATT ACGTTACAACAGAT CAT GT G 896 

I I 1 I I I I I III I I I I I M I I I I I III I I I I I II I I I II I I I I I I I I I II I 
Db 1546 AAC CCTTTTCTT GT AG C AGC AC AGGAT T CT GAGACAGAT TAT GT CACAAC AGATAAT T T A 1605 

Qy 897 T CAAAGGT GAC C GAGGAAGTAGT GG CAAAC AT GC CT GAAG GT C T AAC C C CAGAT T T GGT T 956 

I I I I I I I I I I I I I II II I I II I I I I I I I I I I I I I I I I I II II I I I I I I II II 
Db 1606 ACAAAGGT GACT GAGGAAGT C GT GG CAAAC AT GC CT GAAGGC CT GACT C CAGAT T T AGT A 1665 

Qy 957 CAGGAAGC AT GT GAAAGT GAATT GAAT GAAGCTACT GGTACAAAAATT GC CT TTGAAACA 1016 

I I I I I I I II I I I I I I I i I II I I I I I I I I I I I II I I I I I I I I I I Mill i I I I I I I I 
Db 1666 CAGGAAGCAT GT GAAAGT GAATT GAAT GAAGTT ACT GGT ACAAAGATT GCTT AT GAAACA 1725 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGT7VACACAG 1076 

I I I I I I I I I I I I I I I I I I I I I I II I I I MM I II I II II II I I I I I I I I I I 
Db 172 6 AAAAT GGAC T T G GT T CAAAC AT C AGAAGTT AT GCAAGAGT C ACT CT AT C C T G C AG CAC AG 1785 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I M I I I M I M I M II I II I I M I I I I I I I I I I I I I I I I M I I I I I I I II I I 

Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGT^GCACCATTTWVTTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I M I I I II I I I I I I Ml 
Db 184 6 AT GGAAG CAC C ATT GAAT T CT G CAGTT C CT AGT GCT GGT G CT T C C GT GAT ACAGC C C AG C 1905 

Qy 1197 T CAT CAC CAT T AGAAACT CT T C CT T C AGTTAAT TAT GAAAGCATAAAGT T T GAGC CT GAA 1256 

I I I I I M II I I I I I I I I I I I II I M M I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1906 T CAT CAC CAT T AGAAGC TT CTT CAGTTAATT AT GAAAGCATAAAACAT GAGCCT GAA 1962 

Qy 1257 AAT C C C C CAC CAT AT GAGGAGG C CAT GAAT GT AT CAC T AAAAAAAGAAT CAGGAAT GAAT 1316 

M I II I II I I I I I I I I I I I I I I M I I I I I I II I II II I I II I I I I I I I I I II II 
Db 1963 AAC C C C C CAC CAT AT GAAG AGGC CAT GAGT GT AT CACT AAAAAAAGT AT CAGGAAT AAAG 2022 

Qy 1317 GAAGAAAT CACAGAGC CT GAAGGTATT AGT GT AGCT GTT CAGGAAACAGAAGCT CCTT AT 1376 

M II I I I I I I I I I I I II I I I I I I I II II I I II II I II I I I II M I I I I I II I 
Db 202 3 GAAGAAAT TAAAGAGC CT GAAAAT ATTAAT GCAGCT CTT CAAGAAACAGAAGCT CCTTAT 2 082 

Qy 1377 AT AT CT ATT GCAT GT GATTTAATTAAAGAAACAAAGAT CT CTACT GAACCGACT CCAGAT 1436 

II II II II I I II I I I II II I I I I I I I I I I I I I I I II I III I I II I I I II I I III 

Db 2083 AT AT C TAT T G CAT GT GAT T T AAT T AAAGAAAC AAAG CTTTCTGCT GAAC C AG C T C C G GAT 2142 

Qy 1437 T T CT C T AGT TAT T CAGAAAT AGCAGAAGTT GCAC AGC C AGT G C C C GAG CAT T CT G AGCT A 1496 

MUM II II II I I I II I III I I I I I I I I I I II II I I II II I I I I I I II I I I I 
Db 2143 T T CT C T GAT TAT T CAGAAAT G GCAAAAGTT GAAC AGC C AGT G C CT GAT CAT T CT GAGC T A 2202 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T C T GAAC C AGT T GACT TAT T T AGT GAT GAT T C AAT AC C C 1556 

I I I I I I I II I II I I II I I I I I I I II I I I I I II I I II II II II II I I II II I I I M II 



Db 2203 GT T GAAGAT T C C T CAC C T GATT CT GAAC C AGT T GACT T AT T T AGT GAT GAT T C AATAC C T 2262 

Qy 1557 GAAGT T C C AC AAAAACAAG AT GAAGC T GT AATAC T T GT GAAAGAAAAC CT CAC T GAAAT T 1616 

II I I II I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I 
Db 2263 GAC GT T C C ACAAAAACAAGAT GAAACT GT GAT G CTT GT GAAAGAAAGT C T C ACT GAGAC T 2322 

Qy 1617 T CAT C T GAGT CAAT G AC AGGAC AT GACAAT AAGG GAAAAC T C AGT G CT T CAC CAT CAC CT 1676 

I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2323 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAACT C AGT GC T T T G C CACCT 2379 

Qy 1677 GAG G GAGGAAAAC C GT AT T T GGAGT CT TT T C AG C CC AGT T T AGG CAT C AC AAAAGAT AC C 1736 

I I I I I I I I I I I II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 238 0 GAGGGAGGAAAG C CAT AT T T GGAAT CT TT TAAGC T C AGT T T AGAT AAC ACAAAAGAT AC C 2 439 

Qy 1737 T T AGC AC CT GAT GAAGT T T C AG CAT T GAC C CAAAAG GAGAAAAT C C C T T T G C AGAT GGAG 1796 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 244 0 C T GT T AC CT GAT GAAGT T T CAAC AT T GAGCAAAAAG GAGAAAAT T C C T T T G CAGAT G GAG 24 99 

Qy 1797 GAG C T CAAT ACT G C AGT T TAT T CAAGT GAT GGC T TAT T CAT T G C T C AG GAAGCAAAC C T A 1856 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I III II I I I I I I I I I II 
Db 2500 GAGCT CAGT ACT GC AGT T TAT T CAAAT GAT GAC T TAT T T AT TT CTAAGGAAG C ACAG AT A 2559 

Qy 1857 AG AG AAAG T G AAAC AT T T T CAGAT T CAT C T C C GAT T GAG AT TAT AG AT GAG T T C C C GAC C 1916 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I II 

Db 2560 AGAGAAACT GAAAC GTT T T CAGATT CAT CT C CAAT T GAAAT TAT AG AT GAGTT CC CT ACA 2 619 

Qy 1917 T T T GT CAGT T C T AAAG CAGAT TCTTCTCC T AC AT TAG C C AG G G AATAC AC T GAC C T AG AA 197 6 

II I I I I I I I I I I I I I I I I I I I III I I I I M I I I I I I I I I I I I I II I I I I II 

Db 2 62 0 T T GAT C AGTT CT AAAACT GAT T CAT T T T CTAAAT T AGC C AG GGAAT AT ACT GAC CT AGAA 2679 

Qy 1977 GT AG C C CACAAAAGT GAAAT T GCT GAC AT C C AG GAT G GAGC T GGGT C ATT GGC T T GT GC A 2036 

III I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I II I II 

Db 2680 GT AT C C CACAAAAGT GAAAT T G CT AAT G C C C C G GAT G GAGCT GG GT C ATT G C CT T G CAC A 2739 

Qy 2037 GGAT T GC C CC AT GAC CT T T C T T T CAAGAGT AT ACAAC C TAAAGAG GAAGT T CAT 2090 

I M I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I Mill II I I 

Db 274 0 GAAT T GC C C CAT GAC CT T T CT T T GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 2 7 99 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I II I I II I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 
Db 2800 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GAT GTTTCTGCTTTG GAT GCT CAAG C AGAGAT AGGC AGC AT AGAAAAAC C CAAAGT T CT T 2210 

I I II I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2860 GAT GTTTCTGCTTT G GC CAC T CAAGC AGAGAT AGAGAGC AT AGT T AAAC C C AAAGT T CTT 2919 

Qy 2211 GT GAAAGAAGC C GAGAGAAAACTT CCT T CT GATACAGAAAAAGAGCGAAGAT CT C CAT CT 2270 

I I II I I I I I 1 I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 2 92 0 GTGAAAGAAGCTGAGAAAAAACTT CCTTCCGATACAGAAAAAGAGGACAGATCACCAT CT 2 979 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2980 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3039 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 



wy 


OOQ1 

z j y i 


1 1 LAGLA1 1 Gl GAG1G1AAOGGOC1 ACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 


2450 






1 1 1 ii 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 ii 1 1 ii 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3100 


TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 


3159 




O A R 1 
Z 4 D 1 


1 1 1 AbbAlAl Al Mbbbl bl GA1 CCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 


2510 






M 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 M 1 II 1 1 1 1 




Db 


3160 


T T TAG GAT AT ACAAGGGT GT GAT C C AAG C TAT C CAGAAAT C AGAT GAAGGC C AC CCAT T C 


3219 


Qy 


ZD 11 


AG G GCAT AT T T G GAAT C T GAAGT T GCT AT AT C T GAGGAGT T GGT T C AGAAGT AC AGC AAT 


2570 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 I I Ml 




Db 


3220 


AGG GC AT AT C T GGAAT CT GAAGT T G CT AT AT C T GAG GAGT T GGT T C AGAAGT AC AGT AAT 


3279 


Qy 


ZO /l 


TCTGCTCTTG GT CAT GT TAACT GC ACAAT AAAAGAACT C AGAC G C C T CT T CT T AGT T GAT 


2630 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db„ 


3280 


TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 


3339 


Qy 


ZD Jl 


GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 


2690 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 




Db 


3340 


GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 


3399 


Qy 


2 691 


TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 


2750 






M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 




Db 


3400 


TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 


3459 


Qy 


2751 


ATTTAT GAAC GGCATCAGGC GCAAATAGAT CATTAT CT GGGACTT GCAAATAAGAAT GT T 


2810 






1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 




Db 


3460 


ATTTAT GAAC GGC AT CAGGCGCAGAT AGAT CAT TAT CTAGGACTT GCAAATAAGAAT GTT 


3519 


Qy 


2811 


AAAGAT G CT AT GGCTAAAAT C CAAG CAAAAAT C C CT GGAT T GAAG C GT AAAGCT GAAT GA 


2870 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3520 


AAAGAT GCT AT GGCTAAAAT C CAAGCAAAAAT C C CT G GAT T GAAGC GCAAAGC T GAAT GA 


3579 



RESULT 11 
ABN86601 

ID ABN86601 standard; DNA; 3579 BP. 
XX 

AC ABN86601; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo encoding DNA. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 
KW vulnerary; cerebroprotective ; anti-tumour; antidiabetic; anticonvulsant; 
KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 
KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 
KW neurotransmitter receptor; human; gene; ds . 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1. , .3579 

FT /*tag= a 

FT /product= "Nogo" 



FT /note= "Nogo-A f Nogo-B and Nogo-C" 

XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2 001US-0089334 8 . 
XX 

PR 19-MAY-1998; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-00218277 . 

PR 19-MAY-1999; 99US-00314161 . 
XX 

PA (YE DA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR P-PSDB; ABB81078, ABB81079, ABB81080. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Disclosure; Page 49-53; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents a DNA 

CC encoding the human neurotransmitter receptor protein Nogo (Nogo-A, Nogo-B 

CC and Nogo-C) , an example of NS-specific antigen 



XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 U; 0 Other; 



Query Match 58.0%; Score 2223.6; DB 6; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I II II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 

III I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I I I I I 

Db 772 GT ATT AC CCACT GAAGGAACACTT CAAGAAAAT GT CAGT GAAGCTT CTAAAGAGGT CT C A 831 

Qy 121 GAGAAGGCAAAAAATCCATTT GT AGAGAGAAATT TAACAGAATTTT C AGAATT GGAAT AT 180 

I I I I I I I I I I I II II I I I I I I III I I I I I I I II I I I I I I I II I I I I I I II 
Db 832 GAGAAG G CAAAAAC T CT ACT C AT AGAT AGAGAT T T AAC AGAGT TT T CAGAAT T AGAAT AC 8 91 

Qy 181 T C AGAAAT GGAAT CAT C ATT CAGT GGCT CT CAAAAGGC AGAAC CT G C C GT AACAGTAG C G 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I II I I I II I I I I I I 

Db 8 92 T C AGAAAT GGGAT CAT CGTT CAGT GT CT CT C CAAAAG CAGAAT CT G C C GT AAT AGT AGC A 951 

Qy 241 AAT C C T AG GGAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAGGAC T T AGT T AGT CT T 3 00 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I 

Db 952 AAT C C T AGGG AAGAAAT AAT C GT GAAAAAT AAAGAT GAAG AAGAGAAGT T AGT T AGT AAT 1011 

Qy 301 AAC AT C C TT CAT ACT CAGCAGGAGT TAT CT AC AGT C CT T AC GAAAT C AGT T GAAGAAGAA 360 

I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I III I I I I II 

Db 1012 AACAT CCTTCATAATCAACAAGAGTTACCTACAGCT CTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 361 GAT AGAGT T CT GT C T C CAGAAAAAACAAAG GAC AGT T T TAAGGAAAAGGGAGT T GCAG C A 420 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINI I I I I I I I I I 

Db 1072 GA AGT T GT GT C TT CAGAAAAAGCAAAAGAC AGT T T TAAT GAAAAGAGAGT T GCAGT G 112 8 

Qy 421 GAAGCT T CT AT G G G GGAGGAAT AT GC AGAC T T CAAAC CAT T T GAGC GAGT AT G GGAAGT G 4 80 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1129 GAAGCT C C TAT GAG GGAGGAAT AT G C AGAC T T CAAAC CAT T T GAGC GAGT AT G GGAAGT G 1188 

Qy 4 81 AAAGAT ACT TACAAGCAAGAT AGT GAT GT T T T GAT T GCT G GAG GT AAT AT AGAGAGC AAA 540 

I I I I I I I I III I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I I I I I 
Db 1189 AAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCTG GAG GT AAAAT C GAGAGC AAC 124 5 

Qy 541 T T GGAAGGT AAAGT GGAT AAGAAAC ACT T T T C AGAT AGC C T T GAACAAACAAAT C GT GAA 600 

I I I I I I I I I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1246 T T G GAAAGT AAAGT GGAT AAAAAAT GT T T T G C AGAT AG C C T T GAGCAAACT AAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGCAGT AAT GAT GAC AC T T CAT T T C C CAGT ACAC C AGAAG C T GT AAGA 660 

I I M I I I II I I II I I I I I I II I II I I I I I II I I I I I I I I I I I I I I I I Ml 
Db 1306 AAAGATAGT GAGAGT AGT AAT GAT GAT AC TTCTTTCCC CAGT AC G C C AGAAG GT AT AAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAAC AACT GAGAAT GT T 717 

I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M 
Db 13 66 GAT C GT C C AGGAGC AT AT AT CAC AT GTGCTCCCTT T AAC C CAG C AG C AACT GAGAGC AT T 142 5 

Qy 718 T C AACAAAC AT TTTTCCCTT GT T G GAAGAT CAT AC T T C GGAAAATAAGACAGAT GAAAAA 777 

I I I I I I I I II I I I II I Mill I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 



Db 1426 GCAACAAAC AT T T T T C CT TT GT T AG GAGAT C C T AC T T C AGAAAAT AAG AC C GAT GAAAAA 1485 

Qy 77 8 AAGAT AGAA- AAAAAAAG GC AC AAAT T GT AAC AGAGAAGAAT G CAAGT GT CAAGAC AT C A 836 

M I I I I I I I I M II I I I II I I I I I I I I I I II I II III I I I I I I 

Db 1486 AAAATAGAAGAAAAGAAGGCC CAAATAGTAACAGAGAAGAATACT AGCAC CAAAACAT CA 154 5 

Qy 837 AACC C T T T C CT TAT G G C AGCAC AGGAGT CT AAGAC AGAT T AC GT T ACAAC AG AT CAT GT G 896 

I I I I I I I I Ml I I I I I I I I I I I I III II I I II I II I II I I I I I I I I I II I 
Db 154 6 AAC CCTTTTCTT GT AGC AGC ACAG GAT T CT GAGAC AGAT TAT GT C AC AAC AGAT AAT T T A 1605 

Qy 897 T CAAAG GT GAC C GAG GAAGT AGT GG CAAAC AT G C C T GAAG GT C T AAC C C C AGAT T T G GT T 956 

I I I I I I I I I I I II i I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAGC T ACT GGT ACAAAAAT T GC C T T T GAAAC A 1016 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I II 
Db 1666 C AGGAAG CAT GT GAAAGT GAAT T GAAT GAAGT T AC T G GT ACAAAGAT T G CT T AT GAAAC A 1725 

Qy 1017 AAAAT GGAC CT GGT T CAAAC TT CAGAAGC T GT G C AG GAGT C ACT T T AC C C T GTAACAC AG 1076 

I I I I I I I I I I M I I I I I I I I I I I I I I I MM I I II II I I II I I I I I Mill 

Db 1726 AAAAT GGACT T GGT T CAAAC AT C AGAAGT TAT GCAAGAGT CAC T CT AT C CT G C AGCAC AG 17 85 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

II I I I I I II I I I I I II I II II II I II II II M II II I I I I II II I II II I I II I 

Db 17 8 6 CT T T GC C CAT CAT T T GAAGAGT CAGAAGC TACT C CT T CAC CAGT T T T G C CT GAC AT T GT T 1845 

Qy 1137 ATGGAAGCACCATT7WVTTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I II I I II M I I I I I M II I I I II I II II M I I I II II II M I I I II I III 

Db 1846 AT GGAAG CAC CAT T GAAT T CT G CAGT T C C T AGT GCTGGTGCTTCCGT GAT AC AGC C C AG C 1905 

Qy 1197 T CAT CAC CAT T AGAAAC TCTTCCTT C AGTT AAT TAT GAAAG C ATAAAGT T T GAG C CT GAA 1256 

II I I I I I I I II I I I I I I I I I I II I I II I M I II II II M I II II II I I I I I I 

Db 1906 T CAT CAC CAT T AGAAG C TTCTTCAGTTAATTAT GAAAGCATAAAACAT GAGCCTGAA 1962 

Qy 1257 AAT C CC C CAC CAT AT GAG GAGGC CAT GAAT GT AT C ACT AAAAAAAGAAT CAG GAAT GAAT 1316 

II I I M I II I II I I II II I I II II I I I I II M I I I II I II I II II I I I II I I II 
Db 1963 AAC C CCCCACCATAT GAAGAGGCCAT GAGTGTAT CACTAAAAAAAGTAT CAGGAATAAAG 2 022 

Qy 1317 GAAGAAAT C AC AGAGC CT GAAG GT ATT AGT GTAGCT GT T CAG GAAAC AGAAG C T C C TT AT 137 6 

I M I I I I I I II I I II I I I I I II I I II I II I I II I I I M I II I I I I II I I I I I 
Db 2023 GAAGAAAT T AAAGAGC C T GAAAAT AT T AAT GCAG CT CT T CAAGAAAC AGAAGCT C CTT AT 2 082 

Qy 1377 AT AT CTATT GCAT GT GATTTAATTAAAGAAACAAAGAT CT CTACT GAAC CGACT C CAGAT 1436 

I II I II II II I M I I I I I II I I I I II II I I I II I II I III I I II II I I I I I III 
Db 2083 AT AT CT AT T GCAT GT GAT T T AATT AAAGAAAC AAAG CT T T CT G C T GAAC C AGCT C C GG AT 2142 

Qy 1437 T T CT CT AGT T AT T C AGAAAT AGCAGAAGT T G C ACAGC CAGT G C C C GAG CAT T C T GAGC T A 1496 

M II I I I I I I I I I II II I III I I II I I I I I II I M li I I II I I I I I I M I I I I 
Db 2143 T T CT CT GAT TAT T C AGAAAT G G C AAAAGT T GAACAGC C AGT GC C T GAT CAT T C T GAGC T A 2202 

Qy 14 97 GTT GAAGAT T CCT CC C CCGATT CT GAAC CAGTT GACTTAT TTAGT GAT GATT CAATAC CC 1556 

I II I M I II I II I I II I I I I I I II I I I I I I II I I M I I II I I I I I I I I I I I I II I I I 

Db 22 03 GTT GAAGAT T C C T CAC C T GAT T CT GAAC CAGTT GACT TAT TTAGT GAT GAT T CAATAC C T 22 62 

Qy 1557 GAAGT T C C AC AAAAACAAG AT GAAG CT GT AAT AC T T GT GAAAGAAAAC C T CAC T GAAAT T 1616 

II I M I I II II I I I I I II II I I I I II I II II I I II I II I I II I II I II I I I I 

Db 22 63 GAC GT T C C ACAAAAACAAG AT GAAAC T GT GAT GCT T GT GAAAGAAAGT C T C ACT GAG ACT 2322 



Qy 1617 T CAT C T GAGT CAAT GAC AGGAC AT G AC AAT AAG G GAAAACT C AGT G C T T C AC CAT C AC CT 1676 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I II I I I Mill 

Db 2323 T CAT T T GAGT CAAT GAT AGAAT AT GAAAATAAG GAAAAAC T CAGT GCT T T G C CACCT 2379 

Qy 1677 GAG G GAGGAAAAC C GT AT T T GGAGT CT T T T C AG C C CAGT TT AG GC AT C AC AAAAG AT AC C 1736 

II I II I I I II I II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

Db 23 80 GAGGGAGGAAAGC CATATTT GGAAT CT T T TAAGCT CAGTTTAGATAACACAAAAGATACC 24 39 

Qy 1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 1796 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 244 0 C T GT T AC CT GAT GAAGT T T CAAC AT T GAG CAAAAAGGAGAAAAT T C C T T T G C AGAT GGAG 2 4 99 

Qy 1797 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGT^AGCAAACCTA 1856 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I III II I II II I I I I II 
Db 2500 GAGCT CAGT ACT GCAGTTTATT CAAAT GATGACTT ATTTATTT CTAAGGAAGCACAGATA 2559 

Qy 1857 AGAGAAAGT GAAAC ATT TT CAGATT CAT CT CC GAT T GAGATTATAGAT GAGTT C CC GACC 1916 

I I I It I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 

Db 2560 AGAGAAACT GAAAC GT T T T CAGATT CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C C T AC A 2619 

Qy 1917 T T T GT CAGT T CTAAAGC AGATT C T T CT C C T AC AT T AGC CAGGGAAT AC AC T GAC CT AGAA 1976 

II I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2620 T T GAT CAGT T CT AAAACT GATT C AT T T T C T AAAT TAG C CAG GGAAT AT ACT GAC CTAGAA 2679 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2 68 0 GT AT C C C ACAAAAGT GAAAT T GC TAAT GC C C C GGAT GGAGCT GGGT CAT T G C CT T GC ACA 2 739 

Qy 2037 GGAT T G C C C CAT GAC CT T T C TT T CAAGAGT AT ACAAC C TAAAGAGGAAGT T CAT 2090 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 2740 GAATT GC CCCAT GAC CTTT CTTT GAAGAACAT ACAACCCAAAGTT GAAGAGAAAAT CAGT 27 99 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I I I I I II I I I I I I I I I I I III I I I I I I I II I I II I I I I I I I I 
Db 2 8 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AGGC AGC AT AGAAAAAC C CAAAGT T CT T 2210 

I I I I I I I II I I I I I I I II I I 1 I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I 
Db 2 860 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 2 919 

Qy 2211 GT GAAAGAAG C C GAGAGAAAACT T C CT T C T GAT AC AGAAAAAGAG C GAAG AT C T C CAT C T 2270 

I I I I I I I I I I I MM II I I I I I I I I I I M I II I I I I I II I I I I I I I I MINI 

Db 2 92 0 GT GAAAGAAG CT GAGAAAAAAC TTCCTTCC GAT AC AGAAAAAGAG GAC AG AT C AC CAT C T 2 97 9 

Qy 2271 GCT AT AT T T T C AGC AGAG C T GAGT AAAAC T T C AGTT GT T GACCT C CT C TACT G GAGAGAC 2330 

I II I I I I I I II II I I I I I I I I I II II I I I I II I II I I I I I I II I I I I I I II I I I I I I I I 
Db 298 0 GCTATATTTT CAGCAGAGCT GAGT AAAACTT CAGTT GTT GACCT C CT GT ACT G GAGAGAC 3039 

Qy 2331 ATTAAGAAGACT GGAGT GGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I II I II I I I I I I I I I II I I I II I M I M I I I I I I I I I I I II II I I I I I I I I 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

Qy 2 391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

I I II I I I II I I I II I I I II I I I I I I I I I M I I I I I I I II I I I I II I I I I I I I M I I I 
Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 



Qy 24 51 T T TAG GAT AT AT AAGG GT GT GAT C C AG GCT AT C C AGAAAT C T GAT GAAGG C C AC C CAT T C 2510 

I I II I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I II I I I I I I II I I I I I I I I I I 
Db 3160 T T T AGGAT AT ACAAGGGT GT GAT C CAAG C TAT C C AGAAAT C AGAT GAAG GC C AC C CAT T C 3219 

Qy 2511 AG G GCAT ATT T G GAAT C T GAAGT T G CT AT AT CT GAGGAGT T G GT T C AGAAGT AC AG CAAT 2570 

I I I I I II II I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I II I I I I I I III 
Db 3220 AGGGCAT ATCT GGAAT CT GAAGT T GCT AT AT CT GAGGAGT TGGTT CAGAAGT AC AGTAAT 3279 

Qy 2571 TCTGCTCTTGGT CAT GT T AACT G C AC AATAAAAGAACT C AGAC GC CT C T T CT T AGTT GAT 2 63 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I || I I 
Db 32 8 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
Db 3340 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3399 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 3400 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3459 

Qy 2751 AT T TAT GAAC G G CAT CAG GC GCAAAT AGAT CAT T AT CT GGG AC T T GCAAAT AAGAAT GT T 2 810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I II I I I II I I I I 
Db 3460 ATT TAT GAAC GGCAT C AG GC GC AGAT AGAT CAT TAT C TAG GACT T G CAAAT AAGAAT GT T 3519 

Qy 2811 AAAGAT GCTAT G G CTAAAAT CCAAG C AAAAAT C C C T GGAT T GAAGC GTAAAGC T GAAT GA 2870 

I M M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I || I I I I I I I I I I I I I I I I I I 
Db 3520 AAAGAT G CT AT GGCT AAAAT CCAAGCAAAAAT C C CT G GAT T GAAGC G CAAAG CT GAAT GA 3579 
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XX 

PR 06-NOV-1998; 98US- 01074 4 6P . 
XX 
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XX 

PI Schwab ME, Chen MS; 
XX 
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DR P-PSDB; AAY71310. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 26; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a cDNA encoding rat Nogo A protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. The 

CC present sequence was generated by fusing R018U37-3, R1-3U21 cDNA 

CC sequences isolated from hexanucleotides-primed rat brain stem/spinal cord 

CC library, and 01il8 cDNA from an oligo d(T) -primed rat oligodendrocyte 

CC library. Nogo proteins and fragments displaying neurite growth inhibitory 

CC activity are used in the treatment of neoplastic disease of the CNS e.g. 

CC glioma, glioblastoma, medulloblas toma, craniopharyngioma, ependyoma, 

CC pinealoma, haemangioblastoma, acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblastoma or retinoblastoma and degenerative nerve 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promote Nogo activity can be used to treat or prevent hyperprolif erative 

CC or benign dysprolif erative disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 4684 BP; 1358 A; 1048 C; 1112 G; 1166 T; 0 U; 0 Other; 



Query Match 57.0%; Score 2183.4; DB 3; Length 4684; 

Best Local Similarity 78.0%; Pred. No. 0; 

Matches 3004; Conservative 0; Mismatches 736; Indels 109; Gaps 27; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

I M I I I I I I I I II I I I I I I I I I I I I I I I M || I I I I I I | | M | Ml I I I I I 

Db 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 98 7 

Qy 61 GT AC T G CC C ACT GAAGGAAC AC T T C C AG C AACT T C AAAT GAAGC T T CT AAAGC AT T C T C A 12 0 

II I I I I I I I I I I II I I I I I I I I I I I I II II 

Db 988 GT GT CAT CCT CAGAAGGAACAATT GAAGAAACTTTAAAT GAAGCT T CTAAAGAGTT GC CA 1047 



Qy 121 GAGAAGGCAAAAAAT CCATT T GT AGAGAGAAATTTAACAGAATTTT CAGAAT T GGAAT AT 180 

I I I I I I I M I I I I I I I I I I I I I I III I I I I I I II I I II I I I I I I I I I I I I I I I 
Db 1048 GAGAGGGCAACAAAT C CATT T GTAAATAGAGATTTAGCAGAATTTT CAGAATTAGAAT AT 1107 

QY 181 TCAG7VAAT GGAAT CAT CATT CAGTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGCG 24 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I M | | 

Db 1108 T C AGAAAT GG GAT CAT CT TT TAAAG GC T C C C C AAAAG GAGAGT C AG C CAT AT T AGT AGAA 1167 

Qy 241 AAT C CT AG GGAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAGGAC T T AGT T AGT C T T 300 

II M I I I I I I I II I I I II I I I I I I I I I I I I I I I || I I I || | | | | | 

Db 1168 AACACTAAGGAAGAAGTAATT GT GAGGAGT AAAGAC A AAGAGGAT T T AGT T T GT AGT 1224 

Qy 301 AACAT CCTT CATACT CAGCAGGAGTT AT CTACAGTC CTTAC GAAAT CAGTT GAAGAAGAA 360 

I I I I I I I I I I I I I I I I I | | | Mill 

Db 1225 G C AG C C C T T C AC AG T C C AC AAG AAT C AC C T GTGGGTAAAGAA 12 66 

Qy 361 GAT AGAGT T CT GT CT C C AGAAAAAACAAAG GACAGT T T TAAGGAAAAGGGAGT T GCAG C A 42 0 

I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I | M I I 

Db 1267 GAC AGAGT T GT GT CT C C AGAAAAGACAAT G GACAT T T T TAAT GAAAT GCAGAT GT CAGT A 1326 

Qy 421 GAAGCT T C TAT G G GGGAG GAAT AT GC AGAC T T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 480 

I Ml II II I I I I I I M I I I I I I I II II I I I I I I I I I II I I I I I I I II I 
Db 1327 GT AGCAC CT GT GAGG GAAGAGT AT GC AGAC T T T AAGC C AT T T GAACAAGC AT G GGAAGT G 1386 

Qy 481 AAAGAT ACTTACAAGCAAGAT AGT GAT GTTTT GATT GCT GGAGGTAATATAGAGAGCAAA 54 0 

I I I I I I I I I I I II I I I I I I I I I II I I I I I II I I I I 
Db 1387 AAAGAT ACTT AT GAGGGAAGT AG GGAT GTGCTGGCTGC T AGAG C TAAT 14 34 

Qy 541 T T GGAAGGTAAAGT GGATAAGAAAC AC TT T T CAGAT AG CCTT GAACAAACAAAT C GT GAA 600 

I I I I I I I I I I I I II I I III III I I | | | | || | | | | | || | | | | | | 
Db 1435 GT GGAAAGTAAAGT G GACAGAAAAT G CT T G GAAGAT AGC C T GGAGCAAAAAAGT CT T GG G 14 94 

Qy 601 AAAGAT AGT GAAAGC AGT AAT GAT GACAC T T C ATT T C C CAGT AC AC C AGAAGCT GT AAGA 660 

M I I I I I I I I I MM Mill II I I I I I I MINIM MUM MM I 

Db 1495 AAG GAT AGT GAAGGCAGAAAT GAGGAT GCTTCTTTCCC CAGT AC C C C AGAAC CT GT GAAG 1554 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 

I I Ml II II II II II MINI I I I I II I M MM II I I I 

Db 1555 GACAGCT C CAGAG CAT AT AT T AC CT GT G CTT CCTT T AC CT C AGCAAC C GAAAGC AC C AC A 1614 

Qy 721 AC AAAC AT TTTTCCCTTGTTG GAAGAT CAT AC T T C G GAAAAT AAG AC AG AT GAAAAAAAG 780 

I M I M Ml M II II I II I I I II || M II I I II I II I I I I II I I I I I II II I 
Db 1615 GCAAAC AC TTTCCCTTT GT T AGAAGAT CAT ACT T CAGAAAAT AAAACAGAT GAAAAAAAA 1674 

Qy 781 AT AGAA- AAAAAAAG GC ACAAAT T GT AACAGAGAAGAAT GCAAGT GT CAAGAC AT CAAAC 839 

M I II I II I I I I II II I I II I I I I I I I I I II I III I I I I I I II I I 
Db 1675 AT AG AAG AAAGGAAG G C C C AAAT T AT AAC AG AGAAGA CTAGCCCCAAAACGTCAAAT 1731 

Qy 84 0 C CT T T C CT T AT GGC AGC AC AGGAGT C T AAGAC AGAT T AC GT T AC AAC AGAT CAT GT GT C A 899 

M I I I I M I I I M I II I II I III II I II I I II II I I I I I I II II I III 

Db 1732 CCTTTCCTT GT AG C AGT ACAG GAT T C T GAGGCAGAT TAT GT T ACAAC AGAT AC C T TAT C A 1791 

Qy 900 AAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTTCAG 959 

M M I I I I M I I M I I II II I II II I II II I I I I II I II II I I I II I I II I I I 
Db 1792 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 



Qy 960 GAAGCAT GT GAAAGT GAAT T GAAT GAAGCT ACT G GT ACAAAAAT T GC C T T T GAAACAAAA 1019 

I I I I I I I I I I I I I I I M I I I I I I I I I I I II I II I I I I I Mill I I I I I I I | I I I 
Db 1852 GAAGCAT GT GAAAGT GAAC T GAAT GAAG C C AC AG GT AC AAAG AT T G C T TAT GAAACAAAA 1911 

Qy 1020 AT GGAC C T GGT T CAAACT T C AGAAGC T GT G CAG GAGT C AC TT T AC C C T GT AAC ACAG C T T 1079 

Mill II I I I I I II II I I II I I I I M II I I II II I II II I II I I II II 
Db 1912 GT G GAC T T GGT C CAAAC AT C AGAAG C T AT ACAAGAAT CAC TT T AC C C C AC AG C ACAGCT T 1971 

Qy 1080 TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 1139 

M I II I I I II II I III I II II I I II II I II I I I I I II I II II I I I II I I I III 
Db 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

Qy 114 0 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 1199 

I M M II I II I II II II I I I I II II II II I II II I II I I I II II M I I I I I 

Db 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

Qy 1200 T CACCATTAGAAACT CTT CCTT CAGTTAATTAT GAAAGCATAAAGT TT GAGC CT GAAAAT 1259 

II M I I I I I I I I M I MIMI I II II I II I II I I I I I II I I I M I II I 

Db 2092 T C C C CAC T GGAAGC ACCT C C T C C AGTT AGTT AT GAC AG TAT AAAG CTT GAG C C T GAAAAC 2151 

Qy 1260 C C C C CAC CAT AT GAG GAGGC C AT GAAT GT AT CACT AAAAAAAGAAT C AGGAAT GAAT GAA 1319 

I I M I II II I I I II II II I I II I I I I II II II I II II I I II I I I M I 
Db 2152 C C C C CAC CAT AT GAAGAAG C CAT GAAT G T AG CAC T AAAAG C T T T G G GAAC AAAG GAA 2208 

Qy 1320 GAAAT C AC AGAGC CT GAAGGT AT T AGT GT AGCT GT T CAG GAAAC AGAAGCT C C T TAT AT A 137 9 

I Ml I II I I I II II I II III II II I I II II M II I I II I I I I II I II I I II I I 
Db 2209 GGAATAAAAGAG C C T GAAAGT T T TAAT G C AG CT GT T CAGGAAAC AGAAGCT C CTT AT AT A 2268 

Qy 138 0 T CT AT T GC AT GT GAT T TAAT T AAAGAAACAAAGAT C T CT ACT GAAC C GACT C C AGAT T T C 1439 

M I I II I I I II II I I I II I I II I I II I II I I MM I II II II I I I II I II I II 
Db 22 69 T C CAT T G C GT GT GAT T TAAT T AAAGAAAC AAAG C T C T C CAC T GAG C C AAGT C C AGAT T T C 2328 

Qy 14 4 0 T C T AGTT AT T CAGAAAT AGC AGAAGT T GCACAGC C AGT G C C C GAGC AT T CT GAG C TAGT T 14 99 

I II I II II I I II I II II II I II I I I I I I II II I II II II I II I I II I 

Db 232 9 T CT AATT AT T CAGAAAT AG CAAAAT T C GAGAAGT C GGT GC C C GAACAC G C T GAGC TAGT G 2388 

Qy 1500 GAAGATT CCTCCCCC GAT T CT GAAC C AGT T GACT T AT TT AGT GAT GAT T CAAT AC C C GAA 1559 

II I 1 II II M II M I I I II I I I II I II I I II I II I M I II I I M II II II Ml 

Db 2389 GAG GAT T C CT CAC CT GAAT CT GAAC CAGT T GACT TAT T TAGT GAT GAT T C GATT C CT GAA 2 44 8 

Qy 1560 GT T C C ACAAAAACAAGAT GAAG C T GTAAT ACT T GT GAAAGAAAAC CT CACT GAAAT T T C A 1619 

II I II II II I I II II II II I II II II I II I I I I II I II II II I I I 

Db 244 9 GT C C CACAAAC ACAAGAG GAG GCT GT GAT G C T CAT GAAGGAG AGT C T CACT GAAGT GT C - 2 507 

Qy 162 0 T C T GAGT CAAT GAC AGGAC AT GACAAT AAGGGAAAACT CAGT GC T T CAC CAT CAC C T GAG 1679 

I M M I II I Ml I II I II I II II I I || III 
Db 250 8 T GAGACAGT AG C C C AGC AC AAAGAG GAGAGAC T TAGT GC CT CAC CTCAGGAG 255 9 

Qy 1680 GGAGGAAAAC C GT AT TT G GAGT CT T T T CAGC C CAGT T TAG G CAT C ACAAAAGAT AC CT T A 1739 

I I I I I I II I II I I I I II I I I II I I II I II II II I II I II II II I 
Db 2560 C TAG GAAAGC C AT AT T T AGAGT CT T T T CAGC C CAAT T T AC AT AGT ACAAAAGAT G C T 2616 

Qy 174 0 GC AC CT GAT GAAGT T T CAGC AT T GAC C CAAAAG GAGAAAAT C C C T T T G C AGAT GGAG GAG 1799 

III II I II I II II I I II I I II I II II I I II II I I I I I II II II II I III 

Db 2617 GCAT C TAAT GACAT T C C AAC AT T GAC CAAAAAGGAGAAAAT T T C T T T G CAAAT G GAAGAG 2 67 6 

Qy 18 00 CT CAAT AC T GC AGT T T ATT CAAGT GAT GGCT T AT T CAT T GC T CAG GAAGCAAAC C T AAGA 1859 



Db 2677 T T T AAT ACT G CAAT T TAT T CAAAT GAT GACT T ACT T T C T T C T AAG GAAGACAAAATAAAA 2736 

Qy 1860 GAAAGT GAAACAT TTT CAGAT T CAT CT CCGATT GAGAT TATAGAT GAGTT CC C GAC CT T T 1919 

I I I M II I I I I I I I I I II I I 1 II I II II I I I I I I I I I I I I I II I I I I Ml 

Db 2737 GAAAGT GAAACAT TTT CAGAT T CAT CT C C GAT T GAGAT AAT AGAT GAAT T T C C C AC GT T T 2796 

Qy 192 0 GT C AGT T C TAAAG CAGAT T CT T CT C CT AC AT T AGC CAG G GAAT AC AC T GAC C T AGAAGT A 197 9 

I I II I I I I I I MM I I I I I I II I I I I II I I III I I I I I I II II II I II I I 

Db 2797 GT C AGT G C T AA AGAT GAT T CT C C T AAAT T AGC CAAG GAGT ACACT GAT C T AGAAGT A 2853 

Qy 1980 GC C C ACAAAAGT GAAAT T G C T GAC AT C C AGGAT G GAGC T G G GT CAT T G GC T T GT G CAGGA 2 039 

M I I I I I II I I I I I I II I I I I I I I I Mill I I I I I I I I I I III 

Db 2854 T C C GACAAAAGT GAAAT T GCT AAT AT C CAAAGC GGGG CAGAT T CAT T G C CT T GCTT AGAA 2913 

Qy 2 04 0 TT GC C C CAT GAC CT T T CT TT CAAGAGT AT ACAAC CT AAAGAGGAAGT T CAT GT C CCAGAT 2099 

MUM I I I I I II I I I II I I M I I II I I II I I I I II Mill Mill I I I II 

Db 2914 TTGCCCTGT GAC CTTTCTTT C AAGAAT AT AT AT C C T AAAGAT GAAGT AC AT GT T T CAGAT 2 973 

Qy 2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 2159 

M II I II I II II II I II I I I I I I I MM I I I I I I I I II I I I 

Db 2974 GAAT T CT C C GAAAAT AGGT C C AGT GT AT CT AAGGCAT C CAT AT C GC C T T CAAAT GT CT CT 3033 

Qy 2160 GCTT T G GAT GCT CAAGC AG AGAT AGGC AGC AT AGAAAAAC C CAAAGT T C T T GT GAAAGAA 2219 

M M M M I II I II I I I I II I II I M I I I I I I II 11 Ml II I I I II 
Db 3034 GCTT T GGAAC CT CAGAC AGAAAT G GGC AG CAT AGT T AAAT C CAAAT CACT T AC GAAAGAA 3093 

Qy 2220 G C C GAGAGAAAACT T C CT T CT GAT AC AGAAAAAGAGC GAAGAT C T C CAT CT G CT ATAT TT 2279 

II M M I I II I I I I II I I II I II I II II II II II I II I II I II II I I 

Db 3094 GC AGAGAAAAAACT T C CT T C T GAC AC AGAGAAAGAGGAC AGAT C C C T GT C AGC T GT AT T G 3153 

Qy 2280 T C AGC AGAGC T GAGT AAAAC T T C AGT T GT T GAC CT CCT CT AC T GGAGAGAC AT T AAGAAG 2339 

M II II I II I I I I I I M I M II I I I M I II I II M I II M I II II II I I I I M II I II I I 

Db 3154 T C AGC AGAGCT GAGTAAAAC T T C AGT T GT T GAC CT CC T CT ACT GGAGAGAC AT T AAGAAG 3213 

Qy 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 2399 

II II I II II II I I I II II I I I I I I I I I I I M I I I II I II M M I II I I I II I II II 
Db 3214 ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 3273 

Qy 2 4 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 245 9 

M M M M I I M I M M I I I I I II II I I I I I II II I I II I I II I II II II II II I I I I 
Db 32 7 4 GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 3333 

Qy 2460 T ATAAGGGT GT GAT CC AGGC TAT C CAGAAAT CT GAT GAAGGCCAC C CATT CAGGGCATAT 2519 

M II II II I I II II I I II I I I I II I I II M I I II I II I II I I I I I II I I II I I I I I II 
Db 3334 T AT AAGG GC GT GAT CCAG GCT AT C CAGAAAT CAGAT GAAGG C CAC C CAT T CAG G G CAT AT 3393 

Qy 252 0 T T G GAAT C T GAAGT T G CT AT AT C T GAG GAGT T GGT T C AGAAGT ACAG CAAT TCTGCTCTT 2579 

II I M II I I II I I I I II I I I I I II I I I I I I II II I I I I Mill I II I I I I I I I II 
Db 3394 T T AGAAT CT GAAGT T G CT AT AT C AGAG GAAT T G GT T CAGAAAT AC AGT AAT TCTGCTCTT 3453 

Qy 2580 GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 2639 

M I I I II I II I I I I II I I I I I I II II I II II II II I II II I I I II I I II I I I I I 
Db 3454 GGT CAT GT GAAC AG C ACAAT AAAAGAAC T GAGG C G G C T T T T CT T AGT T GAT GAT T T AGT T 3513 

Qy 264 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2699 

I I I I I I M M M I I I I I I I I I || I I II I II I I II II II I II I II I I I I II I I II I I I 



Db 3514 GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 3573 

Qy 27 00 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 2759 

I I I I I I I I I I I I I I I I I II I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I 
Db 3574 GGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAA 3633 

Qy 2760 C G G CAT CAG GC G CAAAT AGAT CAT TAT CT G G GACT T G CAAAT AAGAAT GT TAAAG AT GCT 2819 

I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 3634 C GGCAT C AGGT GC AGAT AGAT CAT TAT CT AG GACT T G C AAACAAGAGT GT T AAG GAT GC C 3693 

Qy 282 0 AT GG CT AAAAT C CAAGCAAAAAT C C CT G GAT T GAAG C GTAAAGC T GAAT GAGAAAGC C T G 2 87 9 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I Mill II Ml I I II I I 
Db 3694 AT G G C CAAAAT C CAAGCAAAAAT C C CT GGAT T GAAGC GCAAAGC AGAT T GAAAAAGC C C C 3753 

Qy 2880 AAAGAGT T AAC AAT AGAGGAGT T TAT CT T T AAAGGGGAT AT T CAT T T GAT T C CAT T GG G G 2 93 9 

• I I I M I I II M I I II II I I II II I III I I II I I I Ml 
Db 3754 AA ACAGAAGT T C AT CT T T AAAGG G GAC ACT CACT T GAT T AC GGG 3797 

Qy 2 94 0 AGGGT C AGG GAAGAACAAAGC C T T GAC AT T G C AGT GC AGT T T C ACAGAT C T TT AT T T T T A 2999 

II I I II I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 3798 GGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT--CAGCTCTTTATTTTTA 3855 

Qy 3000 GCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAG 3059 

III Ml III I I I I I II I II I II I II I I I I I I I I II II I I I I I I I I I M II II 
Db 3856 GCAGTGCACTGTTTGAGGAAAAATTACCTGTCTTGACTT-CCTGTGTTTATCATCTTAAG 3914 

Qy 3060 TAT T GT AAG CT GCT AT GT AT GGAT T TAAAT C GTAAT CAT AT T T GT T T T T C CT GT AT GAG G 3119 

I I I I I I I I I I I II I M I I I I I I I I I I I II I II I I I I I II II I 
Db 3915 TATTGTAAGCTGCTGTGTATGGATCT-CATTGTAGTCACACTTGTCTTCCCCAATGAGGC 3973 

Qy 312 0 C ACT GGT GAATAAACAAAGAT CT GAGAAAG CT GT AT AT T ACAC T T T GT C G CAG GT AGT C T 317 9 

II I I I I I II II I I II I II I II I M I III I I I I I I II I II 

Db 3974 GCCT GGT GAAT AAAGGA — CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCT 4 031 

Qy 3180 T GC T GT AT T T G GGGAAT T GCAAAGAAAGT GGAG CT GAC AGAAAT AAC C CT T T T CAC A 3236 

I I II II I I I I I I I I I I II I I III I I I I I I I I I I I II M II I 
Db 4032 AGCT GTAT GCAGAGAGTTGTAAAGAAGGCAAAT CT GGGGGCAGGGAAAAC C CT T TT CACA 4091 

Qy 3237 GT T T GT G CACT GT GT AC GGT CT GT GT AG GT T GAT G CAGAT T T T CT GAAAT GAAAT GTT T A 3296 

I I I I I II I I III I I I II I II I I II I II II II II II 

Db 4092 G — TGTACTGTGTTTGGTCAGTGTAAAACTGATGCAGATTTTTCTGAAATGAAATGTTTA 4149 

Qy 32 97 GAC GAGAT CAT G C CAC CAAG G CAG GAGT GAAAAAGCT T GCCTTTCCT GGT AT GT T C T AGG 3356 

II I I I I I I I I I I I I I I II Mill II I II I I I I I I I II I I I I 

Db 4150 GATGAGAGCATACTACTAAAGCAGAGTGGAAAACTCTGTC — TTTATGGTGTGTTCTAGG 4207 

Qy 3357 T GT ATT GT GAAATTTACTGTT GTATTAATT GCCAATATAAGTAAAT ATAGATT ATAT AT A 3416 

I I I I I I I I I I I II I I I I I I I I I I I I I II II II II II I I II I I I I I I 
Db 42 08 TGTATTGTG-AATTTACTGTTATATT GCCAATATAAGTAAATATAGA CCTAA 4258 

Qy 3417 TCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTT 3476 

I I I I I I I I II II I II II I I I I II II I I II I I I I I I I I I I II I I I I I I I I I I I I 

Db 4259 TCTATATATAGTGTTTCACAAAGCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTT 4318 

Qy 3477 GATACTT CTGTCAT GGGTTTTATGT GT GTAGT CCCAAAGCACATAAGCTAGGGAGAAACG 3536 

II I I I I I I I M I I M I MM I II I I I I 

Db 4319 GAC C T CT GAGT CAT T G GT T AT G C AGT GTAGT C C CAAG C AC AT AAAC TAGGAAG AGAAAT G 4378 



Qy 3537 TACTTCTAGGCGCACTACC-ATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACT 3595 

I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I 

Db 4379 TAT T T GT AGGAGT G C T AC CT AC C AC C T GT T T T CAAGAAAAT AT AGAAC T C C AACAAAAAT 44 38 

Qy 3596 C C T CAAC AT AAAC T T C AC T G C AC AGACT T AC T GT AGT TAAT T T TAT C AC AAAC T C 3650 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 439 AT AGAAT GT CAT T T CAAAGAC T TACT GT AT GT AT AGT TAAT T T T GT C AC AGAC T C T GAAA 44 98 

Qy 3651 T G GAC T GAAT CT AAT GC T T C CAAAAAT GT T T GCAAAT AT C AAACAT T GT TAT GT A 3705 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 44 99 T T CT AT GGACT GAAT T T CAT GC T T C C — AAAT GT T T G C AGT TAT CAAAC AT T GT TAT GC A 4 556 

Qy 37 06 AGAAAAT AT - AAAT GAC GAT T T AT ACAAT T GT GGT TT AAGCT GT AT T GAACT AAAT CT GT 37 64 

M I I I II I I I I I I II MINI I I I I I I I I I I I I I I III M II I I I I I I I 
Db 4557 AG AAAT C AT AAAAT GAAG AC T TAT AC CAT T G T G GT T T AAG C C G T AC T GAAT T — ATCTGT 4614 

Qy 3765 GGAAT GCATT GT GAACT GTAAAAGCAAAGT AT CAATAAAGCTTATAGACTTAAAAAAAAA 382 4 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 4615 GGAAT GCATT GT GAACT GTAAAAGCAAAGT AT CAATAAAGCTTATAGACTTAAAAAAAAA 4674 

Qy 3825 AAAAAAAAA 3833 

I I I I I I I I I 
Db 4675 AAAAAAAAA 4683 

RESULT 13 
ABN86600 



ID ABN86600 standard; DNA; 4684 BP. 
XX 

AC ABN86600; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo encoding DNA. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; gene; ds . 
XX 

OS Rattus norvegicus. 
XX 

FH Key Location/Qualifiers 

FT CDS 253. .3744 

FT /*tag= a 

FT /product= "Nogo— A" 

XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2 001US-008 9334 8 . 
XX 

PR 19-MAY-1998; 98IL-00 12 4 50 0 . 



PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-00218277 . 

PR 19-MAY-1999; 99US-00314 161 . 
XX 

PA (YE DA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR P-PSDB; ABB81074, ABB81076, ABB81077. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS . Disclosure; Page 40-44; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents a DNA 

CC encoding the rat neurotransmitter receptor protein Nogo (Nogo-A, Nogo-B 

CC and Nogo-C) , an example of NS-specific antigen 

XX 

SQ Sequence 4684 BP; 1358 A; 1047 C; 1112 G; 1167 T; 0 U; 0 Other; 

Query Match 56.9%; Score 2179.4; DB 6; Length 4684; 

Best Local Similarity 78.2%; Pred. No. 0; 

Matches 3008; Conservative 0; Mismatches 731; Indels 110; Gaps 28; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I III I I I I I 



Db 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

Qy 61 GT ACT G C C CAC T GAAG GAAC AC T T C CAG CAACT T CAAAT GAAG CT T CTAAAG CAT T CT C A 120 

II I I I I I I I I II I I I I I I I I I I I I I I II I II II I I 1 II I I II II 

Db 988 GT GT CAT C C T C AGAAG GAAC AAT T G AAGAAAC T T T AAAT GAAG C T T C T AAAGAGT T G C C A 1047 

Qy 121 GAGAAG GC AAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT TT T C AGAAT T G GAAT AT 18 0 

I I I I I I II I I I I I I I I I I I I I I I III I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1048 GAGAGGGCAACAAAT C C ATT T GT AAAT AG AG AT T TAG C AGAAT TTT C AGAAT T AGAAT AT 1107 

Qy 181 T C AGAAAT G GAAT CAT CAT T C AGT G GC T C T CAAAAGGC AGAAC CT GC C GT AAC AGTAG C G 24 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1108 T CAGAAAT GGGAT CAT CT T T T AAAGG C T C C C CAAAAGG AGAGT CAG C CAT AT T AGTAGAA 1167 

Qy 241 AAT C CT AGGGAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAG GAC T T AGT T AGT C T T 300 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II II II I I I 
Db 1168 AACACT AAG GAAGAAGT AAT T GT GAGGAGT AAAGAC A AAGAGGATT TAGTT T GT AGT 122 4 

Qy 301 AACAT C C T T CAT AC T C AGC AG GAGT T AT C T AC AGT C CT T AC GAAAT C AGT T GAAGAAGAA 360 

I I I I I I I I I I I I I I I I I III I I I I I 

Db 1225 GCAGC C CT T CAC AGT C C ACAAGAAT C ACC T GTGGGTAAAGAA 1266 

Qy 361 GATAGAGTT CT GT CT C CAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTT GCAGCA 42 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I III I 

Db 1267 GACAGAGTTGTGTCTCCAGAAAAGACAAT GGACATTTTTAAT GAAAT GCAGAT GTCAGTA 132 6 

Qy 421 GAAG CT T CT AT GG G G GAGGAAT AT GCAGACT T CAAACC AT T T GAGC GAGT AT GG GAAGT G 48 0 

I Ml II II I I I I II I I I I I I I II I I II I I I I I I I I I II I I I I I I I I I I 
Db 1327 GT AGC AC C T GT GAGG GAAGAGT AT GCAGACT T T AAGCC AT T T GAACAAGC AT G GGAAGT G 1386 

Qy 4 81 AAAGAT ACT T ACAAGCAAGAT AGT GAT GT T T T GAT T GCT GGAG GTAAT AT AGAGAG CAAA 54 0 

I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I I I 

Db 1387 AAAGAT ACT TAT GAGG GAAGT AGGGATGTGCTG GCT GCTAGAGCT AAT 14 34 

Qy 541 TT GGAAGGTAAAGT GGATAAGAAACACT TTT CAGATAGCCTT GAACAAACAAAT CGT GAA 600 

I I I I I I I I II I I II I I III III I I I I I I I I I I I I I I I I I I I I I 
Db 1435 GT GGAAAGT AAAGT GGACAGAAAAT GCT T GGAAGAT AGC CT G GAG CAAAAAAGT CT T G GG 1494 

Qy 601 AAAGAT AGT GAAAGC AGTAAT GAT GACAC T T CAT T T C C CAGT ACAC C AGAAGCT GTAAGA 660 

II I I I I I I I I I I I I I I I I I I II I I I I II I I I I I II I I I I I I I I I I I I 

Db 14 95 AAGGAT AGT GAAGGC AGAAAT GAGGAT GCTTCTTTCCC CAGT AC C C C AGAAC CT GT GAAG 1554 

Qy 661 G GT GGT T C C G GAGC GT AC AT CAC GT GTGCTCCCTT T AAC C CAACAAC T GAGAAT GT TT C A 720 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 
Db 1555 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACA 1614 

Qy 721 AC AAAC AT TTTTCCCTTGTT GGAAGAT CAT AC T T C G GAAAAT AAGAC AGAT GAAAAAAAG 78 0 

I I I I I I III II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1615 GC AAAC ACT T TCCCTTTGT T AGAAGAT CAT AC T T CAGAAAATAAAAC AGAT GAAAAAAAA 1674 

Qy 781 AT AGAA- AAAAAAAGG C ACAAAT T GT AAC AGAGAAGAAT G CAAGT GT CAAGAC AT C AAAC 839 

I I I I I I I II I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I 
Db 1675 AT AG AAGAAAGGAAG GC C CAAAT T AT AAC AGAGAAG A CTAGCCCCAAAACGT CAAAT 1731 

84 0 CCTTTCCT T AT GG C AGC AC AGGAGT CT AAGAC AGAT T AC GT T ACAAC AGAT CAT GT GT C A 899 

I I I I I I I I I I I I I I I I I I I I Ml M I I I I I I I I I I I I I I I I I I I I Ml 

1732 CCTTTCCTT GT AG CAGT AC AG GAT T C T GAG GCAGAT TAT GT T AC AAC AGAT AC C T TAT C A 1791 



Qy 

Db 



Qy 900 AAG GT GAC C GAG GAAGT AGT G G CAAAC AT G C C T GAAG GT CT AACC C CAG AT T T G GT T C AG 959 

I I I I I I I I I I I I II I II I I I I II II I I I I I I I I I I I I II II I I I II I I I | I I I 
Db 1792 AAGGT GAC T GAG G C AGC AGT GT CAAAC AT GC C T GAAG GT C T GACG C C AGAT T T AGT T CAG 1851 

Qy 960 GAAG CAT GT GAAAGT GAAT T GAAT GAAG C T AC T GGT ACAAAAAT T G C C T T T GAAACAAAA 1019 

I I I I I I I I I I I I I I I I I I I! M ! II I I I I I I I I I I I I I I I I II I I I I I I 
Db 1852 GAAGCAT GT GAAAGT GAACT GAAT GAAGC C ACAGGT ACAAAGATT GCTT AT GAAACAAAA 1911 

Qy 102 0 AT G GAC C T GGT T CAAACT T C AGAAG C T GT G CAG GAGT CACT T T AC C C T GT AAC AC AGC T T 1079 

I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I 
Db 1912 GT GGAC T T G GT C CAAAC AT C AGAAGCT AT ACAAGAAT CACT T T AC C C C AC AGC AC AG CT T 1971 

Qy 1080 TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 1139 

I I I II I I I I I I I I III I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I III 
Db 1972 T GC C CAT C ATT T GAGGAAGCT GAAGCAACT C C GT CAC CAGT T T T GCC T GAT AT T GT TAT G 2031 

Qy 114 0 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 1199 

I II II I I I I I I I I I I I I I I I I I I II I I I I II II I I I II I I I I I I II I I I I I 

Db 2032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2091 

Qy 12 00 T CAC CAT T AGAAACT CT T C CT T C AGT TAAT TAT GAAAGC AT AAAGTT T GAG C C T GAAAAT 1259 

II III I III I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I II M 

Db 2092 T C CC C ACT G GAAG C AC CT C CT C CAGT T AGT TAT GAC AGT ATAAAGCT T GAGC CT GAAAAC 2151 

Qy 12 60 C C CC C AC CAT AT GAG GAGG C CAT GAAT GT AT CACT AAAAAAAGAAT CAG GAAT GAAT GAA 1319 

I I I I I I II I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I 
Db 2152 C C C C CAC CAT AT GAAGAAGC CAT GAAT GT AGCAC T AAAAGCTTTGGGAACAAAGGAA 2208 

Qy 132 0 GAAAT C AC AGAGC CT GAAG GT AT T AGT GT AG CT GT T CAG GAAAC AGAAG CT C C T TAT AT A 137 9 

I M I I I II I II I I I I II III II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 2209 GGAAT AAAAGAGC CT GAAAGT T T TAAT GCAGC T GT T C AGGAAAC AGAAGC T C CT TAT AT A 22 68 

Qy 138 0 T C TAT T GC AT GT GAT T TAAT T AAAGAAACAAAGAT CT CTACT GAAC C G ACT C C AGAT T T C 1439 

II I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I 

Db 2269 T C CAT T GC GT GT GAT TTAAT TAAAGAAACAAAGCT CT C CACT GAG C C AAGT C C AGATT T C 2328 

Qy 14 4 0 T CT AGTT AT T CAGAAAT AGCAGAAGT T G CAC AGC CAGT GC C C GAG C ATT CT GAGCT AGT T 14 99 

I I M I I I I I I I I I I I I I I I I || I I M I I I I I I I I I II I I I I I I I I I I 

Db 232 9 T C TAAT TAT T CAGAAAT AG C AAAAT T C GAGAAGT C GGT GC C C GAAC AC GC T GAG C T AGT G 2388 

Qy 1500 GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GACT TAT T T AGT GAT GATT CAAT AC C C GAA 1559 

II I I I I I I I I II II I I I I I I I I II I I I I II I I I I I I I I II I I I I I I II II III 

Db 238 9 GAGGAT T C CT CAC C T GAAT CT GAAC C AGT T GACT TAT T T AGT GAT G ATT C GAT T C C T GAA 2448 

Qy 1560 GT T C C ACAAAAAC AAGAT GAAG CT GT AAT AC T T GT GAAAGAAAAC CT CACT GAAAT TT C A 1619 

I I M I I I I I I I I I I I I I I II I I I I I I I I II II I I I I I I I I I I I I I 
Db 24 4 9 GT CCCACAAACACAAGAGGAGGCT GT GAT GCT CAT GAAGGAGAGT CT CACT GAAGT GT C - 2507 

Qy 1620 T CT GAGT CAAT GAC AGGACAT G ACAATAAGG GAAAACT CAGT GCTT CAC CAT C AC CT G AG 167 9 

I II II I I I I III I I I I I I I I I I I I I II III 
Db 2508 T G AGAC AGT AG C C CAGC AC AAAGAGGAGAGAC T T AGT G C C T CAC CTCAGGAG 2559 

Qy 168 0 GGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACCTTA 1739 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 2560 CT AGGAAAG C CAT AT T T AGAGT C T T T T C AGC C CAAT T T ACAT AGT ACAAAAGAT G C T 2616 



Qy 174 0 GC AC C T GAT GAAGT T T C AGCAT T G AC C CAAAAG GAGAAAAT C C C T T T G C AGAT GGAGGAG 17 99 

I M II I I I I I I II II II II I I I I I I I I I I I li I I I I I || || | | || | | | | 
Db 2617 GC AT CT AAT GAC AT T C C AAC AT T GACCAAAAAG GAGAAAAT T T CT T T G CAAAT GGAAGAG 2676 

Qy 1800 C T C AAT ACT G C AGT T TAT T C AAGT GAT G GCT TAT T CAT T G CT CAG GAAG CAAAC C T AAGA 1859 

I M I I I I I I I II I I I I I I I I I I II I I I I I I I I I I II I I II Mil 

Db 2677 T T TAAT AC T G CAAT T TAT T CAAAT GAT GAC T T AC T T T CT T CT AAGGAAGACAAAAT AAAA 2 73 6 

Qy I860 GAAAGT GAAAC AT T T T CAGAT T C AT CT C C GAT T GAGAT TAT AGAT GAGT T C C C GAC CT T T 1919 

I M I I I II I I I I I I I II I I I I I | | || | | | | | M | | | | | MINIM II II II I I I 
Db 2 737 GAAAGT GAAAC AT T T T CAGAT T CAT C T C C GAT T GAGAT AAT AGAT GAAT T T C C C AC GT T T 2 7 96 

Qy 1920 GT C AGT T CTAAAGCAGAT TCTTCTCC T AC AT TAGC C AG GGAAT AC AC T GAC C T AGAAGT A 1979 

I I I I M MM II II II I I I I II II I I II I I III II I I I II I I I II I I I II 
Db 2797 GTCAGTGCTAA AGAT GATT CT C CT AAAT TAGCCAAGGAGTACACT GAT CT AGAAGTA 2853 

Qy 198 0 GCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGA 2039 

M I M I I I I I II I I II II I Mill I I II II II I I Ml 

Db 2854 T C CGACAAAAGT GAAAT T GCT AAT AT C CAAAGC G GGGC AGAT T C ATT GC C T T G CT TAGAA 2913 

Qy 204 0 T T GC C C CAT GACCT T T C T T T CAAGAGT AT ACAAC CT AAAG AGGAAGT T CAT GT C C CAGAT 2 099 

I I I I I I M I I I I II II I I I II II II I I I I I I I I I I I Mill Mill I I M I 

Db 2914 TTGCCCTGT GAC CT T T CT T T CAAGAAT AT AT AT C CTAAAGAT GAAGT AC AT GT T T CAGAT 2973 

Qy 2100 GAGT T C T C CAAAGAT AGGGGT GAT GT T T C AAAG GT G C C C GT AC T GC C T C CAGAT GT T T CT 2159 

I I I I M I I I I I I I I I MINIMI I I I I I II I || M II I I I 

Db 2974 GAAT T C T C C GAAAAT AG GT C CAGT GT AT CTAAGG C AT C CAT AT C GC C T T CAAAT GT CT C T 3033 

Qy 2160 GC T T T GGAT GCT CAAGCAGAGAT AG GC AG C AT AGAAAAAC C C AAAGT T CT T GT GAAAGAA 2219 

N N N I I II II II I I I I II I II II II I I II I II I I I N I I II I M 
Db 3034 G CT T T G GAAC CT C AGACAGAAAT GG G CAGC ATAGT TAAAT C CAAAT C AC T T AC GAAAGAA 3093 

Qy 2220 GC C GAGAGAAAACT T C C T T C T GAT ACAGAAAAAGAG C GAAGAT CT C CAT C T GCT AT AT T T 2279 

N II I I I II II I I II II I I I I I II II I I II II I I I II I I I II I I I I I 
Db 3094 GC AGAGAAAAAACTT C CT T CT GAC AC AGAGAAAGAGGAC AGAT C C CT GT C AGCT GT AT T G 3153 

Qy 2280 T CAGC AGAG C T GAGTAAAACT T C AGT T GT T GAC CT C C T CT ACT G GAGAGAC AT TAAGAAG 2339 

N N II I II I II I I II I II I I I I N I II I II II I I II I II II N II II I II II II I II I I 
Db 3154 T CAG C AGAG C T GAGT AAAACTT CAGT T GT T GAC CT C CT CT ACT GGAGAGAC AT TAAGAAG 3213 

Qy 2340 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 2399 

N I II II I I I II M I I I I I II I I I I I I II I I II I II I II II I I II I I II II I II I I 
Db 3214 ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 3273 

Qy 24 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 24 59 

N I I N I I I N II I I I I I II I II I II II II I I II II I II I II II I I I II I I II I I I II 
Db 3274 GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 3333 

Qy 24 60 T AT AAG G GT GT GATC C AG GC TAT C C AGAAAT CT GAT GAAG GC C AC C CAT T C AG GG CAT AT 2519 

I N N II I I I I II II I II I I I I M I II I I II I I II I I || | | | | || | | || | I | | I I I I | 
Db 3334 T ATAAGGG C GT GAT C C AGG CT AT C CAGAAAT CAGAT GAAG GC C AC C CAT T C AGG GC AT AT 3393 

Qy 2520 T T GGAAT CT GAAGT T G C TAT AT CT GAG GAGT T G GT T C AGAAGT AC AG CAAT TCTGCTCTT 2579 

M M I I N N II I I I I II II II II I I II I II I II I II II II 

Db 3394 T T AGAAT C T GAAGT T G CT AT AT C AGAGGAAT T GGTT CAGAAAT AC AGT AAT TCTGCTCTT 3453 

Qy 2580 G GT CAT GT TAAC T GCAC AATAAAAGAACT C AGAC GC CT CT T CTT AGT T GAT GAT T T AGTT 2 639 



Db 3454 G GT CAT GT GAAC AG C ACAATAAAAGAAC T GAGG C G G C T T T T CT T AGT T GAT GAT TT AGT T 3513 

Qy 2640 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2699 

I I I I I I I I I I I I I I II I I I I I I II I I I M II I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 3514 GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 3573 

Qy 2700 GGTCTGACACTACT7^ATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 2759 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3574 GGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAA 3633 

Qy 2760 C G G CAT CAG G C GCAAAT AGAT CAT TAT CT GGGACT T G CAAAT AAGAAT GT T AAAGAT G CT 2 819 

I I I I I I I I I I Ml I I I M I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I 
Db 3634 C GGC AT C AGGT G C AGAT AGAT CAT TAT C T AGGACT T GCAAACAAGAGT GT T AAGGAT G CC 3693 

Qy 282 0 AT GGCTAAAAT C CAAGCAAAAAT C C C T GGAT T GAAGC GT AAAGCT GAAT GAGAAAGC CT G 287 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Ml I I I I I I 
Db 3694 ATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGATTGAAAAAGCCCC 3753 

Qy 28 80 AAAGAGT T AACAAT AGAGGAGT T TAT CT T TAAAGGGGAT AT T CAT T T GAT T C CAT T GG GG 2939 

M I II I I I I I I I I I I I I I I I I I I I III MINI I III 
Db 3754 AA ; - AC AGAAGT T CAT C T T T AAAG GGGAC AC T CACT T GAT T AC GGG 37 97 

Qy 2 94 0 AGGGT CAGGGAAGAACAAAG C CT T GACAT T GC AGT GC AGTT T C AC AGAT CT T TAT T T T T A 2 999 

II M I I I I I I I I II I II I I I I I I I II I I I I I I I 

Db 3798 GGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT — CAGCTCTTTATTTTTA 3855 

Qy 3000 GCAAC GCAGT GT CT GAG GAAAAAT GAC CT GT C T T GAC TGCCCTGTGTT CAT CAT CT T AAG 305 9 

IN M I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I M I I I I I I 
Db 3856 GCAGT GCACTGTTTGAGGAAAAATTACCTGTCTTGACTT-CCTGTGTTTAT CAT CTTAAG 3914 

Qy 3060 TAT T GT AAGC T G CT AT GT AT GGAT T TAAAT C GTAAT C AT AT TT GT T T T T C C T GT AT GAG G 3119 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3915 TAT T GTAAGCT GC T GT GT AT GGAT CT - CAT T GT AGT CAC ACTT GT C T T C C C CAAT GAG GC 3973 

Qy 3120 CACT GGT GAATAAACAAAGAT CT GAGAAAGCT GTATATT ACACTTT GT C GCAGGTAGT CT 3179 

I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I 

Db 3 974 G C C T G GT GAATAAAGGA — CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCT 4031 

Qy 3180 TGCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCT G AC AGAAAT AAC C C T T T T CAC A 3236 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I 
Db 4032 AGC T GTAT G CAGAG AGTT GTAAAGAAGGCAAAT CT G G G G G CAG GGAAAAC C CT T T T CAC A 4091 

Qy 3237 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 32 96 

I Ml I M I I III I I I I I I II I I II I I II I II II II 

Db 4 092 G — T GT ACT GT GT T T G GT C AGT GT AAAAC T GAT GCAGAT T T T T CT GAAAT GAAAT GT T T A 4149 

Qy 3297 GAC GAGAT CAT GC CAC C AAGGC AG GAGT GAAAAAG CT T G C CT T T C CT GGT AT GT T C TAG G 3356 

M M I I I I I I II I I I I I I II I II || | || M I I I I II I M I I 

Db 4150 GAT GAGAGC AT ACT ACTAAAGCAGAGT GGAAAACT CT GT C — TTTATGGTGTGTTCTAGG 4207 

Qy 3357 T GTAT T GT GAAAT T TACT GT T GT AT T AAT T GC C AAT AT AAGT AAAT AT AGAT TAT AT AT A 3416 

I I M I M II I II I I I I I I I I Ml M I I I M I I I I I II I I I M II I I 

Db 4208 T GTAT T GT G - AAT T T AC T GT TAT AT T G C CAAT AT AAGT AAAT AT AGA CCTAA 4258 

Qy 3417 TCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTT 3476 

II I I I II I I M I I I I I I I I I I II I I II I I I I I I I I I I I I II M I II I I M I I I 



Db 



4259 T CT AT AT AT AGT GT T T C AC AAAG C T T AGAT C T T T AAC C T T G C AG CT GC C C C AC AGT G CT T 4318 



Qy 3477 GAT ACT T C T GT CAT G GGT T T TAT GT GT GTAGT C C CAAAG C AC ATAAG C T AG GGAGAAAC G 3536 

M I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I 

Db 4319 GAC CT C T GAGT CAT T GGT TAT - G C AGT GTAGT C C AAGC AC AT AAAC TAG GAAGAGAAAT G 437 7 

Qy 3537 T AC T T C T AGG C G C AC T AC C - AT CT GT T T T CAAC AC GAAC C GAC GC CAT G CAAACAGAACT 3595 

I I I I I I M I I I I II I I II I I I I I I I I I I I I M I 

Db 4378 T ATT T GTAGGAGT GCT AC CT AC CAC CT GT TT T CAAGAAAAT AT AGAACT CCAACAAAAAT 4 437 

Qy 3596 C C T C AACAT AAACT T CAC T G C AC AGAC T T AC T GTAGT T AAT TT T AT CAC AAAC T C 3650 

M M I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 4438 AT AGAAT GT C ATT T CAAAGAC T T AC T GT AT GT ATAGT TAAT T T T GT C AC AGACT CT GAAA 4497 

Qy 3651 T G GACT GAAT C TAAT G C T T C C AAAAAT GT T T G C AAAT AT C AAAC AT T GT TAT GT A 3705 

I I I M I II I I I I I I I II I I I I I I I I I I I I I II I I I I | I | | | | | | | I | | 
Db 4 4 98 TTCTATGGACTGAATTTCATGCTTCC — AAAT GT T T G C AGT TAT C AAAC AT T GT T AT GC A 4555 

Qy 3706 AGAAAAT AT - AAAT GAC GAT TT AT ACAAT T GT GGT T T AAGCT GT AT T GAAC T AAAT CT GT 37 64 

I I I I I II II I I I I II I I I I I I I I I I I I I I I II I II I II I I I I I I I I I I I 
Db 4556 AGAAAT CATAAAAT GAAGACTT AT AC CAT T GT GGTT T AAG C C GT ACT GAAT T — ATCTGT 4613 

Qy 3765 GGAAT GC AT T GT GAAC T GT AAAAG C AAAGT AT C AAT AAAG C T TAT AGACT TAAAAAAAAA 3824 

I I M I I I I I I I I I II I II I I I I I I I I I I I I I I I I II I I || || I | | | | | | | | | | | | M 
Db 4 614 G GAAT GCAT T GT GAACT GTAAAAG CAAAGT AT C AATAAAGC T TAT AGAT C T T AAAAAAAA 4 673 

Qy 3825 AAAAAAAAA 3833 

I I I I I I I I I 
Db 4674 AAAAAAAAA 4682 

RESULT 14 
AAV30920 

ID AAV30920 standard; cDNA; 2386 BP. 
XX 



AC AAV30920; 
XX 

DT 14-SEP-1998 (first entry) 
XX 

DE Human secreted protein BG160_1 cDNA. 
XX 

KW BG160_1; secreted protein; protein factor; human; ds . 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 102. .2030 

FT /*tag= a 

FT sig_peptide 1863. .1899 

FT /*tag= b 

FT /note= "putative leader/signal peptide" 

FT mat_peptide 1900. .2027 

FT /*tag= c 

XX 

PN W09817687-A2. 
XX 

PD 30-APR-1998. 



XX 

PF 24-OCT-1997; 97WO-US019590 . 
XX 

PR 25-OCT-1996; 96US-0074 0274 . 

PR 24-OCT-1997; 97US-00740274 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, Mccoy JM, Lavallie ER, Racie LA, Merberg D, Treacy M; 

PI Spaulcling V, Agostino MJ; 

XX 

DR WPI; 1998-261426/23. 

DR P-PSDB; AAW58383. 
XX 

PT Nucleic acid encoding secreted protein from human cells - useful, e.g. as 

PT irnmuno-modulators, anti-tumour agents, promoters of tissue growth, 

PT haemostatic and thrombolytic agents etc. 
XX 

PS Claim 20; Page 74-75; 114pp; English. 
XX 

CC This cDNA clone, designated BG160_1, codes for a novel human secreted 

CC protein (see AAW58383) . It was isolated from a human adult brain cDNA 

CC library using methods selective for cDNAs that encode secreted proteins. 

CC The clone is deposited in composite clone ATCC 98232; an oligonucleotide 

CC (see AAT99725) is designed to isolate the clone from the composite. The 

CC predicted AT415_4 amino acid sequence shows homology to neuroendocrine- 

CC specific proteins. Novel cDNA clones (see AAV30916-32) coding for human 

CC secreted proteins (see AAW58580-90) are claimed. These can be used for 
recombinant production of the secreted proteins for analysis, 
characterisation, diagnostic or therapeutic use. They can also be used as 
tissue or mol.wt. markers, for chromosome identification, to identify 

CC genetic disorders, to isolate new related DNA, as sources of primers for 

CC PCR, to generate antibodies, and in interaction trap assays. The secreted 

CC proteins may also have many biological activities, e.g. cytokine, 

CC immunomodulator, haematopoiesis regulating activity, tissue growth 

CC activity, activin or inhibin activity, chemotactic or chemokinetic 

CC activity, haemostatic and thrombolytic activity, receptor/ligand 

CC activity, antiinflammatory, cadherin and tumour invasion suppressor 

CC activity, and tumour inhibition activity. The proteins can be expressed 

CC in vivo from DNA, introduced in gene therapy vectors 
XX 

SQ Sequence 2386 BP; 756 A; 450 C; 494 G; 686 T; 0 U; 0 Other; 



CC 
CC 
CC 



Query Match 46.4%; Score 1777.2; DB 2; Length 2386; 

Best Local Similarity 86.9%; Pred. No. 0; 

Matches 2101; Conservative 0; Mismatches 258; Indels 59; Gaps 11; 

QY 827 C AAGAC AT C AAAC CCTTTCCT TAT G G C AG C AC AGGAGT CT AAGAC AG AT T AC GT T AC AAC 88 6 

Ml I II Ml | | | || | | | | | || | | | | | 

Db 2 CAAAAC AT C AAAC CCTTTTCTT GT AG C AG C ACAGGAT T CT GAGACAGAT T AT GT C ACAAC 61 

Qy 8 87 AGAT CAT GT GT CAAAGGT GAC C G AGGAAGT AGT GGCAAAC AT GC CT GAAG GT C TAAC C C C 94 6 

INI I I I Mill I I | M Mill || M I I 

Db 62 AGAT AAT TTAACAAAG GT GAC T GAGGAAGT C GT GG CAAACAT GC C T GAAG G C C T GAC T C C 121 



QY 947 AGAT T T GGT T CAGGAAG CAT GT GAAAGT GAAT T GAAT GAAG CT ACT GGT AC AAAAAT T G C 1006 
M I II II II II I II II II II II I II II II II II I II | || M II II I II II 



Db 122 AG AT T T AGT AC AG GAAGC AT GT GAAAGT GAAT T GAAT GAAGTT ACT G GT ACAAAGAT T GC 181 

Qy 1007 CTTTGAAACAAAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCC 1066 

I I I I M II I I I I I II II I I I I I I II II I I I I I I I I I I I I I I I II I I I II II 
Db 182 T TAT GAAACAAAAAT GGACT T GGT T CAAAC AT CAGAAGT T AT GCAAGAGT C AC T C TAT C C 241 

Qy 10 67 TGTAACACAGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCC 112 6 

M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I Mill II I I I I I I 
Db 242 T G C AG C AC AG CT T T GC C CAT CAT T T GAAGAGT CAGAAG CT AC T C C T T C AC C AGT T T T G C C 301 

Qy 1127 T G ACATT GT CAT G GAAGCAC CAT T AAAT T C T GT AGT T C C T AGT GCTGGTGCTTCT GC AGT 1186 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I 
Db 302 T GAC AT T GT TAT GGAAGCAC CAT T GAAT T CT G CAGT T C CT AGT GCT GGTGCTTCCGT GAT 3 61 

Qy 118 7 GC AG CT CAGTT C AT CAC CAT T AGAAACT C T T C CT T C AGT T AAT TAT GAAAG C AT AAAGT T 12 4 6 

I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I i || I I || I I I I 
Db 362 AC AGC C CAG CT CAT CAC CAT T AGAAGC T T C T T C AGT T AAT TAT GAAAG C AT AAAAC A 418 

Qy 12 47 T G AGC CT GAAAAT C C C C CAC CAT AT GAGGAG GC CAT GAAT GT AT CAC TAAAAAAAGAAT C 1306 

I I I I I I I I I I I I I I I I I I I I I II || I I I II I I I I I I I I I I I I I I I I II I I I I I III 
Db 419 T GAG C CT GAAAAC C C C C CAC CAT AT GAAGAGG C CAT GAGT GT AT C ACT AAAAAAAGT AT C 47 8 

Qy 1307 AG GAAT GAAT GAAGAAAT C AC AGAG C CT GAAGGTAT T AGT GT AGCT GT T CAG GAAAC AGA 1366 

MINI II I I I I I II I I I I I I II I I I I I M I I II I I I I II I I I I I I I I I I 
Db 479 AG GAAT AAAGGAAGAAAT TAAAGAG C C T GAAAAT AT T AAT GC AGCT CT T CAAGAAAC AGA 53 8 

Qy 1367 AGCT C CT T AT AT AT CT AT T G CAT GT GAT T T AAT T AAAGAAACAAAGAT CT CT ACT GAAC C 1426 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I 
Db 539 AG CT C CT TAT AT AT C TAT T GCAT GT GAT T T AAT TAAAGAAACAAAGCT T T C T G C T GAAC C 598 

Qy 1427 GACT CCAGAT T TCT CTAGTTATT CAGAAAT AGCAGAAGTT GCACAGCCAGT GCCCGAGCA 14 86 

I I I I I I I I I I I I I I I I M I II I I I I III I I I II I I I I I II I I I I I I II II 
Db 599 AG CT C C GGAT T T CT C T GAT TAT T CAGAAAT GG CAAAAGTT GAACAG C CAGT GC CT GAT C A 658 

Qy 14 87 T T CT GAG CT AGTT GAAGAT TCCTCCCCC GAT TCT GAAC C AGT T GACT TAT T T AGT GAT GA 154 6 

I I I I I I I I I I I I I I I I II I II I I I II I I I I I I I M I I I I I I I I I I I I I || I I II I I I I 
Db 659 TT C T GAGCT AGTT GAAGAT T CCT CAC C T GAT TCT GAAC C AGT T GACT TAT T T AGT GAT GA 718 

Qy 1547 T T CAAT AC C C GAAGT T C C ACAAAAACAAGAT GAAG C T GTAAT ACT T GT GAAAGAAAAC C T 1606 

I I I M I I I I II I I II II I I I I I I I I I M I I I I I I I I M II I I I I I I I I I I I II 
Db 719 T T CAAT AC CT GAC GT T C CACAAAAACAAGAT GAAACT GT GAT GC T T GT GAAAGAAAGT C T 77 8 

Qy 1607 CACT GAAAT TT CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT C AGT GCT T C 1666 

I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 779 CAC T GAG AC T T CAT T T GAGT CAAT G AT AGAAT AT G AAAAT AAG GAAAAAC T CAGT G C T T T 838 

Qy 1667 AC CAT CAC C T GAGGGAG GAAAAC C GT AT T T G GAGT C T T T T CAG C C CAGT T TAG GCAT CAC 1726 

I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I III I I I I I II I I III 
Db 839 GC CAC CT GAGG GAGGAAAGC C AT AT T T G GAAT C T T T TAAG CT CAGT T T AG AT AAC AC 8 95 

Qy 1727 AAAAGAT AC CT T AGC AC CT GAT GAAGT T T CAG CAT T GAC C CAAAAGGAGAAAAT C C CT T T 178 6 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 896 AAAAGAT AC C C T GT T AC CT GAT GAAGT T T C AAC AT T GAG CAAAAAG GAGAAAAT T C CT T T 955 

Qy 17 87 GC AGAT GGAGGAGCT CAAT ACT GC AGT T TAT T CAAGT GAT GG C T TAT T CAT T G C T CAG G A 1846 

I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I 
Db 956 G C AGAT G GAG GAGCT CAGT AC T GC AGT T TAT T CAAAT GAT GACT TAT T TAT T T CTAAG GA 1015 



Qy 1847 AG CAAAC C T AAGAGAAAGT GAAACAT T T T C AGAT T CAT CT C C GAT T GAGAT T AT AGAT GA 1906 

I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I II I II II I I I I I I I II I I 
Db 1016 AGCAC AGAT AAGAGAAAC T GAAAC GT T T T C AGAT T CAT C T C CAAT T GAAAT TAT AGAT GA 1075 

Qy 1907 GT T C C C G AC C T T T GT C AGT T CT AAAG C AGAT T CT T CT C CT AC AT TAG C C AG G GAAT AC AC 1966 

I I I I II II II II I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I II 
Db 1076 GT T C C CT AC AT T GAT C AGT T C T AAAACT GAT T CAT T T T CTAAAT TAG C C AGG GAAT AT AC 1135 

Qy 1967 T GAC CT AGAAGT AGC C CACAAAAGT GAAAT T G CT GAC AT C C AGGAT GGAG CT GGGT C AT T 2026 

I I I I I I II I I I I I I I I I I I II I I I II I I I II I I I II I I I I II I I I I I I I I I I I I 
Db 1136 T GAC C T AGAAGT AT C C CACAAAAGT GAAAT T GCT AAT G C C C C G GAT G GAGCT G G GT CAT T 1195 

Qy 2027 GG C T T GT GC AG GAT T GC C C CAT GAC CTTTCTTT CAAGAGT AT ACAAC CTAAAGAG GAAGT 2086 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1196 G C CT T GC AC AGAAT T GC C C CAT GAC CT T T C TT T GAAGAAC AT AC AAC C CAAAGT T GAAGA 1255 

Qy 2087 TCA T GT C C C AGAT GAGT T CT C C AAAGAT AGG G GT GAT GT T T CAAAGGT GC C C GT 214 0 

I I I I I I I I I I I I I I I I I I I I I I III I I I II I I I I I I I 

Db 1256 GAAAAT C AGT TT C T C AGAT GAC T T T T CT AAAAAT GGGT CT GC T AC AT CAAAGGT G CT C T T 1315 

Qy 2141 AC T G C CT C C AGAT GTTTCTGCTTTG GAT GCT CAAG C AG AGAT AGGC AGC AT AGAAAAAC C 2200 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1316 AT T GC CT C C AGAT GT TT CT G CT T T GGC CAC T CAAGCAGAGAT AGAGAG CAT AGT TAAAC C 1375 

Qy 2201 CAAAGTT CT T GT GAAAGAAGC C G AGAGAAAAC T T C C T T CT GAT AC AGAAAAAGAGC GAAG 22 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I II I I I I II I II 
Db 1376 CAAAGTT C T T GT GAAAGAAG CT GAGAAAAAACT T C CT T C C GAT AC AGAAAAAGAG GACAG 1435 

Qy 22 61 ATCTCCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTA 232 0 

III I II I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I II I I I I I I 
Db 1436 AT CAC CAT C T GCT AT AT T T T C AG CAGAG CT GAGT AAAACT T C AGT T GTT GAC C T C C T GT A 1495 

Qy 2321 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTC 2380 

I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I 1 I 
Db 1496 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTC 1555 

Qy 23 81 GCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGT 244 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1556 AT T GACAGT AT T CAG CAT T GT GAG C GTAACAGC C T AC AT TGCCTTGGCCCT GCT C T C T GT 1615 

Qy 24 41 GAC TAT C AG CT T TAG G AT AT ATAAGG GT GT GAT C CAG GC TAT C C AGAAAT CT GAT GAAG G 250 0 

Ml I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1616 GAC CAT CAGCTTT AGGAT AT ACAAGGGT GT GATC CAAGCT AT CCAGAAAT CAGAT GAAGG 1675 

Qy 2501 CCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAA 2560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1676 C CAC C CAT T CAGG GAAGTT G C TAT AT C T GAGGAGT T GGT T CAGAA 1720 

Qy 2561 GT AC AG CAAT TCTGCTCTTGGT CAT GT T AAC T GCACAAT AAAAGAACT C AGAC GC C T CT T 2620 

I I I I I I I I I I I I I I I II I I I I I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1721 GTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTT 17 8 0 

Qy 2621 CT TAGT T GAT GAT T T AGTT GAT T CT C T GAAGT TT G C AGT GT T GAT GT GG GT AT T T AC C T A 268 0 

I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1781 C T TAGT T GAT GAT T T AGTT GAT T CT C T GAAGT T T GC AGT GT T GAT GT G GGT AT T T AC C T A 1840 



Qy 2681 TGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAG 27 4 0 

N I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | || | | | | 
Db 1841 TGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAG 190 0 

QY 2 741 TGTTCCTGT TAT T TAT GAAC G GC AT CAGG C GCAAAT AGAT CAT TAT CT G G G AC T T GC AAA 2800 

M I I I I I I I I I I I I I I I I I I I I I I I II I M II I I I I I I I I I I I I I I I I II I I M I I I 
Db 1901 T GT T C C T GT T AT T TAT GAAC G GC AT CAG GC AC AGAT AGAT CAT TAT C TAG GACT T G C AAA 1960 

QY 2801 T AAGAAT GT TAAAGAT G C TAT G G CTAAAAT C CAAG C AAAAAT C C C T GGAT T GAAG C GT AA 2860 

I I I I I I I M I I I II I I I I II I | | | | | | | | | | || | | | | | | | | || | | | | | | | | | | || | | | | 
Db 1961 T AAGAAT GT TAAAGAT G CT AT G G C T AAAAT C CAAG C AAAAAT C C C T G GAT T GAAG C G C AA 2020 

Qy 2 861 AGCT GAAT GAGAAAGCCT GAAAGAGTTAACAATAGAGGAGTTTAT CTTTAAAGGGGATAT 292 0 

I I I I I I I I M I I I I I I I I I I I I | | | || | | | | | | || || | | | | || | | | 

Db 2021 AGCT GAAT GAAAAC GC C CAAAAT AATTA GTAGGAGTT CAT CTTTAAAGGGGATAT 2075 

Qy 2921 T CAT T T GAT T C CAT T G GGGAG G GT C AGGGAAGAAC AAAG C C T T GAC AT T G C AGT GCAGTT 2 980 

I N II I I I I I I I I I I I I | | | | | | | | | | | || | || | M I I II I I I I I I I I I I I I I 
Db 2076 T C AT TT GAT TAT AC G G G G GAGG GT CAGGGAAGAAC GAA- C C T T GAC GT T GC AGT GCAGTT 2134 

Qy 2981 TCAC AGAT CT T TAT T T T TAG CAAC GC AGT G - T C T GAGGAAAAAT GAC CT 3028 

I I I I I I I I II II I I I I I I I I I I Ml II I I I I I I I I I I || | | | | 

Db 2135 T CAC AGAT C GT T GT T AGAT CT T TAT T TT T AGC C AT GC ACT GT T GT GAGGAAAAAT T AC CT 2194 

Qy 3029 GTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAA 308 8 

I N I I I I I I I I I I I I I I | | | || | | | | | | M I I I I I II I I I I I I I I | || | | | | | || | | | | 
Db 2195 GT C T T GAC T GCC AT GT GT T CAT CAT CT TAAGT AT T GTAAG C T GCT AT GT AT GGAT T T AAA 2254 

Qy 308 9 T C GT AAT CAT AT TTGTTTTTCCT GT AT GAGGC ACT GGT GAAT AAACAAAGAT C T GAGAAA 314 8 

I I I I I I I I I I I I M I I I I I I I I I M || I I I I | | | I I I I I 

Db 2255 C C GT AAT CAT A — TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAA 2301 

Qy 314 9 GCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAG 32 07 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I 

Db 2302 C CT GTAT ATT T T ACTT T GT T G C AGAT AGT CT T GC C G C AT CT T G GCAAGT T G C AGAGAT GG 2361 

Qy 3208 T GGAGCT GACAGAAATAA 3225 

I I I I I I I I III II 
Db 2362 T GGAGCT AGAAAAAAAAA 2379 



RESULT 15 


AAF9I 


3399 


ID 


AAF98399 standard; cDNA; 2386 BP. 


XX 




AC 


AAF98399; 


XX 




DT 


07-JUN-2001 (first entry) 


XX 




DE 


Human cDNA clone BG160 1 sequence SEQ ID 41. 


XX 




KW 


Human; secreted protein; nutrient; cytokine modulator; proliferation; 


KW 


differentiation; immune system modulator; tissue growth; chemotactic; 


KW 


haemostatic; thrombolytic; anti-inflammatory; tumour inhibition; ss; 


KW 


haematopoiesis . 


XX 




OS 


Homo sapiens. 



XX 

PN WO200119988-A1. 
XX 

PD 22-MAR-2001. 
XX 

PF 14-SEP-2000; 2000WO-US025135 . 
XX 

PR 17-SEP-1999; 99US-00398829 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, Mccoy JM, Lavallie ER, Collins-Racie LA, Evans C; 

PI Merberg D, Treacy M, Bowman MR, Spaulding V, Agostino MJ; 
XX 

DR WPI; 2001-244801/25. 

DR P-PSDB; AAB90682. 
XX 

PT Isolated nucleic acids encoding polypeptides, useful for modulating e.g. 

PT cytokine and cell proliferation/differentiation activity, the immune 

PT system and hematopoiesis regulating activity. 
XX 

PS Claim 1; Page 408-409; 557pp; English. 
XX 

CC Human cDNA clones represented in AAF98374 - AAF98489 encode secreted 

CC proteins AAB90667 - AAB90750. The cDNA clones are isolated from various 

CC tissue types, and may be used in the prevention, treatment and diagnosis 

CC of diseases associated with inappropriate protein expression. The 

CC polypeptides and nucleic acids may be used as nutrients or to modulate 

CC cytokine and cell proliferation/differentiation activity and may also be 

CC involved in modulation of the immune system. The cDNA sequences, 

CC proteins, their agonists and/or antagonists exhibit haematopoiesis 

CC regulating activity; tissue growth activity; activin/inhibin activity; 

CC chemotactic/chemokinetic activity; haemostatic and thrombolytic activity; 

CC receptor/ligand activity; anti-inflammatory activity; haematopoiesis 

CC activity; cadherin/tumour suppressor activity; and/or tumour inhibition 

CC activity. Included in the invention are probes represented in AAF98490 - 

CC AAF98572 which are specific for the cDNA clones encoding the secreted 

CC proteins 

XX 

SQ Sequence 2386 BP; 756 A; 448 C; 496 G; 686 T; 0 U; 0 Other; 



Query Match 46.3%; Score 1774; DB 5; Length 2386; 

Best Local Similarity 86.8%; Pred. No. 0; 

Matches 2099; Conservative 0; Mismatches 260; Indels 59; Gaps 11; 

Qy 827 C AAGAC AT CAAAC CC T T T C CT T AT GG CAG C AC AGGAGT CTAAGACAGAT T AC GT T ACAAC 88 6 

HI I I I I I 1 I I I I I I I I III I I I | | || | | | | | | | | | | | | M I I I I II I I I II 
Db 2 CAAAACAT CAAAC CCTTTTCTT GT AG CAG CAC AG GATT CT GAGACAGAT TAT GT C AC AAC 61 

QY 8 87 AGAT CAT GT GT CAAAGGT GAC C GAG GAAGTAGT GGCAAACAT G C C T GAAGGT C TAAC C C C 946 

I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 62 AGATAATTTAACAAAGGTGACTGAGGAAGTCGT GGCAAACAT GCCTGAAGGCCTGACTCC 121 



Qy 947 AGAT TT GGTT CAG GAAG CAT GT GAAAGT GAATTGAATGAAGCTACTGGTACAAAAATT GC 1006 

I M I I I II I II I I I I I I I I I I I I I I I | | | | | || | | | | | | | | | | M II I I I I I I I II 
Db 122 AGAT TTAGTACAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGTTACT GGTACAAAGATT GC 181 



Qy 


1007 


C T T T GAAACAAAAAT G GAC CT G GTT CAAACT T CAGAAG C T GT GC AGGAGT CAC T T T AC C C 

f 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I 1 1 1 1 1 1 1 I I I M I i i i ii i i 
1 t i ii i i ii i ii i i ii i I I I I I I I I I I l l l l l l 1 1 I 1 II 1 1 | | || | | || || 

TTAT GAAACAAAAAT GGAC T T G GTT CAAAC AT CAGAAGT T AT G CAAGAGT CACT CTAT C C 


1066 


Db 


182 


241 


Qy 


1067 


TGTAACACAGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCC 

II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 Mill I 1 1 I I I I I 

11 1 i i i i i i i i i i i l l l l l l l l l l l l l M 1 II 1 i 1 1 1 1 1 I Mill 1 M 1 I 1 I 1 
T G C AG C ACAG CT T T G C C CAT CAT T T GAAGAGT CAGAAGC T ACT C CT T CAC CAGT T T T GC C 


1126 


Db 


242 


301 


Qy 


1127 


TGACATTGTCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGT 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l I I I i i i i i i i i i i i i 
1 1 1 ■ II 1 1 ! 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I | | | { | 

TGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGAT 


1186 


Db 


302 


361 


Qy 


1187 


G CAG C T CAGT T CAT CAC CAT T AGAAACT C T T C C T T C AGT T AAT TAT GAAAGC ATAAAGT T 

1 1 1 1 III 1 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 l l l l l l l I I i i i i i i i i i i i i i i i 
1111 1 M 1 1 1 1 1 M 1 1 II 1 1 II 1 1 1 1 1 1 II || | | | | | | | | | || | | | | | | 

ACAG C C CAG C T CAT CAC CAT T AG AAG C T T CT T C AGT T AAT TAT GAAAGC AT AAAAC A 


1246 


Db 


362 


418 


Qy 


1247 


T GAG C CT GAAAAT C C C C CAC CAT AT GAG GAGGC CAT GAAT GT AT CAC T AAAAAAAGAAT C 

| | | I I I 1 I I 1 | | 1 l l l l I 1 I I I I I I I I I I I I I I I i i i i i i i i i i i i i i i i 

I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 II 1 1 1 I 1 1 1 I 1 1 1 1 1 1 II 1 Ml 

T GAG C CT GAAAAC C C C C CAC CAT AT GAAGAG GC CAT GAGT GT AT C ACT AAAAAAAGT AT C 


1306 


Db 


419 


478 


Qy 


1307 


AGGAAT GAAT GAAGAAAT C AC AGAG C CT GAAGGT AT TAGT GT AGC T GT T C AGGAAAC AGA 

1 1 1 1 If II 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II f 1 l l l l i i i i i i i i i i 
l l 1 1 1 1 II If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M I | | | 1 | | || | | | | | 

AG GAATAAAGGAAGAAAT TAAAGAG C CT GAAAAT AT TAAT GC AGC T C T T CAAGAAAC AGA 


1366 


Db 


479 


538 


Qy 


1367 


AGCT C CT T AT ATAT C TAT T G CAT GT GAT T T AAT TAAAGAAACAAAGAT CT CT ACT GAAC C 

I | | 1 | | | 1 | | | 1 1 I I I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ill ii 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 M 1 1 1 1 1 1 II 1 II M M M 1 1 III 1 1 1 1 1 1 1 

AGCT C CT TAT AT AT C TAT T G CAT GT GATT T AAT TAAAGAAACAAAGC T T T CT GC T GAAC C 


1426 


Db 


539 


598 


Qy 


1427 


GACT C C AGAT T T C T C T AGTT AT T C AGAAAT AGC AGAAGT T G CAC AG C CAGT G C C C GAG C A 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I E 1 1 III 1 1 1 1 1 I 1 ) 1 1 I I t i i i i i ii ii 

MM I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 Ml 1 1 1 1 M 1 1 1 1 1 1 I 1 1 i 1 1 || || 
AGC T C C GGAT T T CT C T GATT AT T CAGAAAT G GCAAAAGTT GAACAG C CAGT G C CT GAT C A 


1486 


Db 


599 


658 


Qy 


1487 


T T CT GAGC T AGT T GAAGATT CC T C C C C C GAT T CT GAAC CAGT T GAC T TAT T TAGT GAT GA 

1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 it i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
M M M 1 M M 1 1 1 II M 1 1 1 1 M II | | | M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 

T T CT GAGC TAGT T GAAGAT T CC T C AC CT GAT T CT GAAC CAGT T GAC T T ATT TAGT GAT GA 


1546 


Db 


659 


718 


Qy 


1547 


T T CAAT AC C C GAAGT T C C ACAAAAACAAG AT GAAGC T GTAAT ACT T GT GAAAGAAAAC C T 

1 1 1 M 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 Mil || M 1 1 1 1 1 1 1 1 l l l ii 
M M II 1 1 1 II 1 If 1 If 1 1 1 1 I I M | 1 1 1 M 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 II 

T T CAAT AC CT GAC GT T C C ACAAAAAC AAGAT GAAACT GT GAT GCT T GT GAAAGAAAGT C T 


1606 


Db 


719 


778 


Qy 


1607 


CAC T GAAAT T T C AT CT GAGT CAAT GAC AG GAC AT GACAAT AAGGGAAAAC T CAGT GCT T C 

1 M If 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 1 M 1 1 1 II 1 1 1 
111111 f Mill 1 1 I II 1 1 1 1 1 1 II f 1 1 1 1 I |[ M II 1 1 1 1 1 1 | 1 | | | | | | 

CACT GAG ACTT CATTT GAGT CAAT GAT AGAATAT GAAAAT AAGGAAAAACT CAGT GCT T T 


1666 


Db 


779 


838 


Qy 


1667 


AC CAT CAC C T GAG GGAGGAAAAC C GT AT T T G GAGT CT T T T C AGC C C AGT TT AG G CAT CAC 

1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II I I I I I I M I I I i i i i ii ii i i ii i i i iii 
i i i I I l l l l l l l l 1 1 1 1 || 1 I 1 II I II 1 If 1 M III II 1 1 1 1 1 1 1 III 

GC CAC CT GAGG GAGGAAAGC C AT AT T T G GAAT CT T T TAAG CT C AGT TT AGATAAC AC 


1726 


Db 


839 


895 


Qy 


1727 


AAAAGAT AC CT T AG CAC CT GAT GAAGT T T CAGCAT T GAC C CAAAAG GAGAAAAT C C C T T T 

M M 1 1 1 1 M 1 II 1 1 1 1 1 1 1 I I I M I I I 1 I I I i i i i ii i i i i i i i m i i i i i 
1111111111 ■ II 1 1 1 1 1 1 f M I M 1 1 1 1 1 1 M 1 II I 1 I 1 I I I I I I I 1 1 1 1 1 

AAAAGAT AC C CT GT T AC C T GAT GAAGT T T CAAC AT T GAGCAAAAAG GAGAAAAT T C C T T T 


1786 


Db 


896 


955 


Qy 


1787 


G C AGAT GGAG GAG CT CAAT AC T GC AGT T TAT T CAAGT GAT G G CT T AT T CAT T G C T CAG GA 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 III II MM 

GC AGAT GGAGG AG C T CAG TACT G C AGT T TAT T CAAAT GAT GACT TAT T T AT TT C TAAG GA 


1846 


Db 


956 


1015 


Qy 


1847 


AG CAAAC C T AAG AGAAAGT GAAAC AT T T T CAG AT T CAT C T C C GAT T GAGAT TAT AGAT GA : 


1906 



1016 AG C AC AGAT AAGAGAAAC T GAAAC GT T T T C AGAT T CAT C T C CAAT T GAAAT T AT AGAT GA 1075 

1907 GT T C C C GAC CT T T GT CAGT T CTAAAG C AGAT TCTTCTCC T ACAT TAG C C AG G GAATACAC 1966 

II I M M I I I I I ! I M I I I | | M I I I I I I I | M 

1076 GT T C C C T AC AT T GAT CAGT T C T AAAAC T GAT T CAT T T T CT AAAT TAGC C AG G GAATAT AC 1135 

1967 T GAC CT AGAAGT AG C C CACAAAAGT GAAAT T G CT GAC AT C C AGGAT GGAGCT G GGT CAT T 2026 

I I I I I I M I I I I I I I I I I I I I I I II I II I I I I I I I I I I I II | | | | | 

1136 TGACCTAGAAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATT 1195 

2027 GGC T T GT GC AG GAT T G C C C CAT GAC CT T T C T T T CAAGAGT AT AC AAC C TAAAGAGGAAGT 2086 

I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | M | | | || | | | | 

1196 GC C T T GC AC AGAAT T G C C C CAT GAC CT T T C T T T GAAGAAC AT ACAAC C CAAAGT T GAAG A 1255 

2087 TCA T GT C C C AGAT GAGT T C T C CAAAGAT AGGG GT GAT GT T T C AAAGGT GC C C GT 2140 

I I M I I I I M I I I I I I II I I I I III II I I I I I I I I | | 

1256 GAAAAT C AGT T T CT C AGAT GACT T T T CT AAAAAT GG GT C T G C T AC AT CAAAG GT G C T CT T 1315 

2141 ACT G C CT C C AGAT GTTTCTGCTTTG GAT G C T CAAGCAGAGAT AGG C AGC AT AGAAAAAC C 22 00 

I M I I I I I I I I I I I I I I I I I I I I I I || || | | || | | | | | | | | | | | | I I I I I I I 
1316 AT T G C CT C C AGAT GTTTCTGCTTTGGC C AC T CAAGCAGAGAT AGAGAGC AT AGT T AAAC C 1375 

2201 CAAAGTT CTT GT GAAAGAAGCC GAGAGAAAACTT C CTT CT GAT ACAGAAAAAGAGC GAAG 2260 

I I I M I I M I I I I | | | I I I I | | | I I I I I I I I I I I I I || | M | | || 

1376 CAAAGT T CT T GT GAAAGAAGCT GAGAAAAAACT T C CTT C C GATAC AGAAAAAGAGGAC AG 14 35 

22 61 ATCTCCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTA 232 0 

Ml M I I I I I I II I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | || | || | | | | | | || 
1436 AT C AC CAT CT G CTAT AT T T T CAG C AGAGC T GAGT AAAAC T T CAGT T GT T GAC C T CCT GT A 14 95 

2321 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTC 2380 

I I I M I I I I I I I I I I I I I | | | | | | M | | | | | | | | | | | | | | | || | | | | | || | | | | | || 
14 96 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTC 1555 

2381 GCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGT 24 40 

I I I I I I I I I I I I I I I I I I I I M I I II I I II I I I I I I I I I I || | | | | M | | | | | | | | 
1556 ATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGT 1615 

2441 GACT AT C AG CT T TAG GATAT AT AAG G GT GT GAT C C AG GC T AT C C AGAAAT C T GAT GAAGG 2500 

I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I 
1616 GAC CAT C AGCT T TAG GAT AT AC AAG G GT GT GAT C CAAGC T AT C C AGAAAT C AGAT GAAG G 1675 

2501 C C AC C CAT T CAG G G CAT AT T T G GAAT CT GAAGT T GCT AT AT CT GAG GAGT T G GT T C AGAA 2560 

I I I I I I I I I I I I I | | || | | | | | | | | | | | | | | | | | | | || | | | | | | | 

1676 C C AC C CAT T CAG G GAAGT T GCT AT ATCT GAG GAGTT GGT T C AGAA 1720 

2561 GT AC AG CAAT T C T GCT C T T G GT CAT GT TAACT GC ACAATAAAAGAAC T C AGAC GC C T CT T 2620 

I I I I I I I I I I I II I I I M I M I I I I M I I I I I I I I I I III 

1721 GTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTT 178 0 

2621 CTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTA 2 68 0 

M I I I I I I II I I I I I I I I II I I I I I I 

1781 C T T AGT T GAT GAT T T AGT T GAT T CT C T GAAGT T T G CAGT GT T GAT GT GGGT AT T T AC C T A 184 0 

2681 T GTT GGT GCCTTGTT CAAT GGT CTGACACTACTAATTTTGGCTCTGATTTCACT CTT CAG 274 0 
I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I | | | | I I M I I I I I I II I I 



Db 1841 TGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAG 1900 

Qy 2 741 TGTTCCTGT TAT T TAT G AAC G G CAT C AG G C G C AAAT AG AT CAT TAT C T G G G AC T T G C AAA 2 800 

1Qni 1111 Nil II II || Mill II II I II || || III II II M Ml M | II III II II I 
Db 1901 TGTTGGTGT TAT T TAT GAAC G GC AT C AGGC ACAGAT AGAT CAT TAT C TAG GACT T GCAAA 1960 

Qy 2 801 TAAGAAT GT TAAAGAT G C TAT GG C T AAAAT C CAAGC AAAAAT C C CT G GAT T GAAGC GT AA 2 860 

I I II I I II I I II I I II I I I II II I I I || I I I I I II I I I I I II I I | | II II I I II I I I II 
Db 1961 TAAGAAT GT TAAAGAT GCT AT GGC TAAAAT C CAAG CAAAAAT C C C T G GAT T GAAG C G CAA 2020 

Qy 2 8 61 AG C T GAAT GAGAAAGC CT GAAAGAGT TAACAAT AGAG GAGT T TAT C T T T AAAG GG GAT AT 292 0 

onoi II Ml Mill II III | MM II I MIMMMMIIMM 

Db 2021 AGC T GAAT GAAAAC GC C CAAAAT AAT T A GT AGGAGT T CAT CT T TAAAGG G GAT AT 2075 



Qy 2921 TCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTT 298 0 

"'""HI I M II I II II I I II I I I I I M II M II I II I II II M II II II 

T C ATT T GAT TAT AC GG GG GAGG GT C AG GGAAGAACGAA- C C T T GAC GT T GC AGT G CAGT T 2134 



Db 2076 



Qy 2981 TCAC AGAT CT T T AT TT T TAG C AAC G CAGT G- T CT GAGGAAAAAT GAC C T 3028 

I I " M II I I I I I I I M II I I I II I II I I II II M I I M MM 

T CAC AGAT C GT T GT T AGAT C TT T AT TT T TAG C CAT G CACT GT T GT GAGGAAAAAT TAC C T 2194 



Db 2135 



Qy 3029 GT CT T GAC T GC C C T GT GT TC AT C AT CT TAAGT ATT GTAAG CT G CT AT GT AT GGAT T T AAA 308 8 

'"""""I " M I M I II II I II II II || || II II II I I II II I M I I I II I I I I 
GT C T T GACT GC CAT GT GT T CAT CAT C T T AAGTAT T GTAAGCT GC T AT GT AT GGAT T TAAA 22 54 



Db 2195 



Qy 3089 T C GTAAT CAT AT T T GT TT TT C C T GT AT GAGGCAC T G GT GAATAAACAAAGAT C T GAGAAA 314 8 

"""III I I M I I I II I I I II I II I I M II I Mill 

CCGTAATCATA— TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAA 2301 



Db 2255 



Qy 3149 GCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAG 3207 

"""I" M I I M I II II II || || M I I I I M II I I || II II I I 
C CTGT AT AT T T T ACT T T GT T GC AGAT AGT CT T G C C G CAT C T T G GCAAGT T GC AGAGAT GG 23 61 



Db 2302 



Qy 32 08 T G GAG C T G AC AGAAAT AA 3225 

I I I I II I I III II 
Db 2362 T G GAGCT AGAAAAAAAAA 2379 



earch completed: September 11, 2004, 02:15:15 
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ALIGNMENTS 



RESULT 1 

US-09-484-970B-106 

; Sequence 106, Application ,US/09484 970B , 

; Patent No. 6426186 1 

; GENERAL INFORMATION: 

; APPLICANT: Jones, Karen A. 

; APPLICANT: Volkmuth, Wayne 

; APPLICANT: Walker, Michael G. 

; TITLE OF INVENTION: BONE REMODELING GENES 

; FILE REFERENCE: PB-0014 US 

; CURRENT APPLICATION NUMBER: US/09/484 , 97 OB 
; CURRENT FILING DATE: 2000-01-18 
; NUMBER OF SEQ ID NOS : 172 
; SOFTWARE: PERL Program 
; SEQ ID NO 106 

LENGTH: 4 822 

TYPE: DNA 

ORGANISM: Homo sapiens 



FEATURE : 

NAME /KEY: misc_feature 
; OTHER INFORMATION: Incyte ID No. 6426186 444857. 15CB1 
NAME/KEY: unsure 

LOCATION: 33, 51, 79, 211, 369, 483-484, 731, 748, 4803, 4805-4806, 4808- 
4809, 

OTHER INFORMATION: a, t, c, g, or other 
US-09-484-970B-106 

Query Match 70.1%; Score 2687.8; DB 4; Length 4822; 

Best Local Similarity 85.6%; Pred. No. 0; 

Matches 3333; Conservative 0; Mismatches 458; Indels 104; Gaps 27; 



Qv 


i 


^ A ^^-lli li L/\iaLLbL.Hjbl 1 1 iAAAGAACGTGAATACCTTGGTGATTTACCAGCA 

II 1 1 1 M 1 1 1 1 1 1 MM M M 1 M 1 M 1 1 1 M M 1 M MM M M 

CT GTCT CCT CTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGT CAACA 


60 


Db 


964 


1023 


Ov 


61 


GTAPTfiPPP APTnA APPATAPTAPTTr^r* A r*r*7\ 7\ r*rnrnr~' 7\ n 7\ m/- 7\ 7\ ^^mmr-*mn -* ^ „ _ „„ 

u±s\^± ijlll/\li bAAbCjAALALi 1 CLAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 

HI 1 1 1 1 M M M 1 M M M M 1 M M 1 1 M M M M 1 M M 1 M M 1 

GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 


120 


Db 


1024 


1083 


Qv 


121 


APr A APiPtP AAAAAATPP7iTTTf"'TRr*7\r , 7\r ,, 7\ 7\ 7\ mmm 7i /-i t\ r*> Ti -n mm mm ^h^-r t. mm ^. — ^. 

^mtnj^/wwv\i llai i L\3i AbAbACjAAA I 1 1 AAC AGAAT T TT C AGAAT T GGAAT AT 

1 1 1 1 1 1 N II II 1 M 1 1 MM 1 1 M M M M M M M M 1 M M 1 

GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 


180 


Db 


1084 


1143 


Qv 


181 


TP Af^A A ATP.P*A ATP Z\TP ZiTTP a r r rr , / r *r ,f pp r pr , 7\ 7\ a t\ ^ ^ -a t\ r .^ m ^, / i r ,« m ~ ■* « rtm .» „ « „ 
x L/ii LAI 1 UAb 1 CjCjL I L I CAAAAGGCAGAACCTGCCGTAACAGTAGCG 

1 1 1 1 1 1 1 1 1 1 MUM M M 1 M M M 1 III 1 1 1 II 1 1 1 1 1 II 1 1 1 INI 

TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 


240 


Db 


1144 


1203 


Qv 


241 


AATCPT AC^f^PrAPPrA A AT AP.TTPTP Z\rrjRr r T 1 7\r , 7\r Am7\ 7\ T\r*T\ l\ r*~K r*T\ /-.mm tv ^mm-A ^.m rtmm 

jrLrv - L J.^^^^j^^'j^vAM.i/ikji Hal bAbbAbl AGAGAl AAAGAAGAGGACTTAGTTAGTCTT 

1 1 M M 1 II 1 1 II 1 1 II 1 III || || | | M II 1 1 1 1 M 1 1 M 1 

AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 


300 


Db 


1204 


1263 


Qv 


301 


AACATCPTTPAT APTPAPP acr,zic r rTa r Pn r PnrB( r, Tpr , mrTi7\ r*r>7\ i\ -n m/-* ~ mrr1 ~ _ _ __ _ 
j-LTt^rt j. x x ^ ^ J- j-v^ J- L/\^o/\ijijAu 1 1 A I u 1 ALAb 1 Lb 1 I AC GAAAT C AGT T GAAGAAGAA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | 1 1 1 1 1 | | | | | | | 1 | | | | | | | | 

AAC AT C CT T CAT AAT CAAC AAGAGT TAC CT AC AG C T CT T AC T AAAT T GGTT AAAGAGGAT 


360 


Db 


1264 


1323 


Qy 


361 


GAT AGAGT T CT GT CT C C AGAAAAAACAAAG GACAGT TT TAAGGAAAAG G GAGT T GC AGC A 

' 1 1 M 1 1 II 1 1 INN II 1 || | | | | | | | | 1 1 

AGT T GT GT C T T C AGAAAAAGCAAAAG ACAGT T T TAAT GAAAAGAG AGT T GC AGT G 


420 


Db 


1324 


1380 


Qy 


421 


GAAG C TT C TAT G GG GGAGGAAT AT GCAG AC T T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 

N 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | || | | || | | || | | | | | | | | | | 

GAAG CT C CT AT GAGGGAGGAAT AT GCAGACT T CAAAC CAT T T GAGC GAGT AT GGG AAGT G 


480 


Db 


1381 


1440 


Qy 


481 


AAAGAT ACT T ACAAG CAAGAT AGT GAT GT T T T GAT T GC T G GAG GTAAT AT AGAGAGCAAA 

1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | | | | || M | | 

AAAGAT AGT AAG GAAGAT AGT GAT AT GT T G G CT GCT GGAGGTAAAAT C GAGAG CAAC 


540 


Db 


1441 


1497 


Qy 


541 


T T G GAAG GT AAAGT G GAT AAGAAAC ACT T T T C AGAT AG C CT T GAACAAACAAAT C GT GAA 

1 N 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

T T G GAAAGT AAAGT G GATAAAAAAT GT T T T GC AGAT AG C CT T GAG CAAAC TAAT C AC GAA 


600 


Db 


1498 


1557 


Qy 


601 


AAAGAT AGT GAAAGCAGTAAT GAT GACACT T C ATTT C C CAGTACACCAGAAGCT GTAAGA 

1 1 1 M 1 1 1 1 1 1 1 | | | | | MMI II 1 1 1 1 1 1 1 | 1 1 1 1 1 | | 1 M 1 

AAAGAT AGT GAGAG TAGTAAT GAT GATACT T CT TT C C C CAGTAC GC CAGAAGGT ATAAAG 


660 


Db 


1558 


1617 



661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC— CAACAACTGAGAATGTT 717 
I I I I I I I I I I I M I I I I I I I I I II I I I I I I II I I | | | | | | | || | | || 
1618 GAT C GTT CAGGAGCATATAT CACAT GT GCT CCCTTTAACC CAGCAGCAACT GAGAGCATT 1677 

718 T C AAC AAAC AT TTTTCCCTTGTTG GAAGAT CAT AC T T C G GAAAAT AAG AC AG AT GAAAAA 777 

I I I I I I I I I I I I I M I I I I I I | | | | | | | | M | | | | | | | | | | | | | | 

1678 G C AAC AAAC AT TTTTCCTTTGT TAG GAG AT C C T AC T T C AG AAAAT AAG AC C GAT GAAAAA 1737 

778 AAGA T AGAAAAAAAAAGGCACAAAT T GTAACAGAGAAGAAT GCAAGT GT CAAGACAT C 835 
N I I I I M I I I M I I I I I I I I I I I I I I I I I I I | | M III I I I I I 

1738 AAAAAT AGAAGAAAAGAAGG C C CAAAT AGT AAC AGAGAAGAAT AC TAG C AC CAAAAC AT C 1797 

836 AAAC CC-TTTCCT TAT G G C AGC AC AG GAGT CTAAGACAGATT AC GT T ACAAC AGAT CAT G 8 94 
I I I I I I Ml Ml I I I I I I I I I I I I III | I I | || || | | || | | | | | | | | | || 
1798 AAAC C C T T T T AC T T GT AGC AGCAC AGGAT T C T GAGACAGATT AT GT C ACAAC AGAT AAT T 1857 

895 T GT CAAAGGT GAC C GAGGAAGT AGT GG CAAACAT G CC T GAAGGT C TAAC C C C AGATT T G G 954 
I I I I I I I I I I I I I II I I I I I I I II || | M I I I I I I I I II M || | | | | | | M I 
1858 T AACAAAGGT GACT GAG GAAGT C GT GGC AAACAT GCCT GAAGGC CT GAC T C C AGAT T TAG 1917 

955 TT CAG GAAGC AT GT GAAAGT GAAT T GAAT GAAGC T AC T G GT ACAAAAAT T GC C T T T GAAA 1014 

I I M I I I I I I I I I I M II MINIMI I I I I I I I I I I I I M | | | | | | | | | 

1918 T ACAGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT T AC T G GT ACAAAGAT T G C T TAT GAAA 1977 

1015 C AAAAAT G GAC CT G GT T CAAACT T CAGAAGC TGTGCAGGAGTCACTTT AC CCTGT AAC AC 107 4 

I I I M I I I I I I I I M I M I M II I I I I I I | | | | | | | | | | || II I I I I I IN 
1978 C AAAAAT GGACTT G GTT CAAACAT CAGAAGTTAT GCAAGAGT CACTCT AT C CTGC AGCAC 2 037 

1075 AGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTG 1134 

I N N I II I II I I II I M I I I II I I I II II I II I Mill I I M I I II II M II I I 

2038 AG CT T T G C C CAT CAT T T GAAGAGT CAGAAGC TACT C CT T CAC CAGT T T T G C C T GAC ATT G 2 097 

1135 TCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCA 1194 

I II M II i M I I I I I I I I M M I I I I I II I II II I I I I I I I I I I I I I I I M 
2098 TTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCA 2157 

1195 GT TCAT CAC CAT T AGAAACT CT T C C T T CAGT TAAT TAT GAAAGCATAAAGT T T GAG C C T G 1254 

I I M I II I I I I I I | | | || I I I I II I || I I II I II I I II II I I I II I 

2158 GCT CAT CAC CAT T AGAAGC T T C T T CAGT TAAT TAT GAAAGC ATAAAAC AT GAGC C T G 2214 

1255 AAAAT C C C C CAC CAT AT GAGG AG G C CAT GAAT GT AT CAC T AAAAAAAGAAT CAG GAAT GA 1314 

UN I M I I I I I I I M M I I II I II II I I I II II I I || I II I I I I I I I I || II I I 
2215 AAAAC C C C C CAC CAT AT GAAG AG G C CAT GAGT GT AT CAC T AAAAAAAGT AT C AGGAAT AA 2274 

1315 AT GAAGAAAT C AC AGAG C C T GAAG GT AT TAGT GT AG CT GTT CAG GAAAC AGAAGC T C C T T 1374 

• I I I I II M I II I I II I II I I I I I II I M I II I I I I 

2275 AGGAAGAAAT T AAAGAG C C T GAAAAT AT TAAT G CAG C T C T T C AAGAAAC AGAAG C T C C T T 2334 

1375 AT AT AT CTATT GCATGT GATTTAATTAAAGAAACAAAGATCTCTACT GAACCGACT C CAG 1434 

I I I I I I I I I M M I I I I I I I I II I I || I II I II I I I I I | Ml M I I I II I I I I I 
2335 AT AT AT CT AT T GC AT GT GATT TAATT AAAGAAACAAAGC T T T CT GC T GAAC CAG CT C C G G 2394 

1435 AT T T C T C TAGT TAT T C AG AAAT AG C AGAAGT T G CAC AG C CAGT GC C C GAG CAT T C T GAG C 14 94 

N I M I M M I M I I I I I I I | | | | || M I I II I I I I I I I I I II I I | || I I I I I 
2395 AT T T CT C T GAT TAT T C AGAAAT G G C AAAAGT T GAAC AG C CAGT GCCT GAT CAT T C T GAG C 2454 

1495 T AGT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GAC T TAT T TAGT GAT GAT T CAAT AC 1554 



2455 TAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATAC 2514 
1555 CCGAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAA 1614 

I 1 I I I I M I I I I I I I I I I I I I I I I I I I || MM MM M I M I I I I 

2515 CTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGA 2574 
1615 TTT CAT C T GAGT C AAT GACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCAC 1674 

I I I I I II I I I I I M II M I MM I I I II II I I I I I II II M II I I I I I 

2575 CTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CAC 2 631 

1675 CTGAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATA 1734 

I > I I I I I I I I I I M II I I II III I M I II II II I I II I 

2632 CTGAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATA 2 691 

1735 CCTTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGG 1794 

1 I ' N I I I I I I I I II I M I I II I I I I I II M I I I I I I I I I 

2692 CCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGG 2751 

1795 AGGAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACC 1854 

Nil I I II I II I I I I I M II I II II I I I I I I Ml M II II I I I I I 

2752 AGGAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGA 2811 

1855 TAAGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGA 1914 

I I I I I I I I I I I I I I I I I I I I I I I M II I I I II I || | M I II I I II I II I I 

2812 TAAGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTA 2 871 

1915 CCTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAG 1974 

1 'I I I I I I I I I M I I I | Ml II II I II I I I I | | | || I I M 

2872 CATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAG 2 931 

1975 AAGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTG 2 034 

''''' I N I H I II II I I I M I II I I II I II II M II I M M M I I 

2932 AAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCA 2991 

2035 CAGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTC 2088 

IN N I I I M II II II I I I M I M MM I II I II M I II II I II I 

2992 CAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCA 3051 

208 9 ATGTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTC 214 8 

' H I I N I I I II M Ml I I || | | I I I I 

3052 GTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTC 3111 

214 9 CAGATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTC 2208 

I I I I I I I II M I I I I I I I I II I I II II II M I I II II II I I II II I II II I I I 
3112 CAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTC 3171 

2209 TTGTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCAT 2268 

I I I I I I I I I I I I I I I I I I I I I M I I I I I II II II I I I II Mill MM 

3172 TTGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCAT 3231 
2269 CTGCTATATTTTCAGCAGAGCTGAG-TAAAACTTCAGTTGTTGACCTCCTCTACTGGAGA 2327 

I I I M I I I I I II I I I I I I I I I I I M II II II I II I II Mill 

3232 CTGCTATATTTTCAGCAGAGCTGAGCTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGA 3291 

2328 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACA 2387 
1 I I I I I I I I I I I I I I I I I I M I I I II II I I I I I || | | I I I | | | || | M II I II I I 



32 92 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACA 3351 

2388 GTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATC 24 47 

N I I I I I M I I I I I I I I MIM I I I I I || I I M I I II I I I I I I I | I I | | | | | || |M 
3352 GTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATC 3411 

244 8 AGCT T TAG GAT AT ATAAGG GT GT GAT C C AG G C TAT C CAGAAAT C T GAT GAAG G C CAC C C A 2507 

I N I I I I I I I I I I I I I I I I I I I I I I I I I || I I I I I | || || | | | | | | | | | 

3412 AG C T T TAG GAT AT ACAAG GGT GT GAT C CAAG CT AT C CAGAAAT C AGAT GAAGGC CAC C C A 3471 

2 508 T T CAG GG C ATAT T T G GAAT CT GAAGTT GC TAT AT C T GAG GAGT T GGT T C AGAAGTAC AG C 25 67 

I Ml I II I I I I I I | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | || | | | 

3472 TT CAGG GCAT AT C T G GAAT C T GAAGT T GC T AT AT C T GAG GAGT T G GT T CAGAAGT AC AGT 3531 

2568 AATT CT GCTCTTGGT CAT GT TAACT G C ACAATAAAAGAAC T C AGAC G C C T C T T C TT AGT T 2627 

I I I I N I I I I I I I I I I I I I I | | | | | | M | | | | I I I I I I I I I I I I I | | | || | | | | | | 
3532 AATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTT 3591 

2628 GAT GAT TT AGT T GAT T CT C T GAAGT T T G CAGT GTT GAT GT G GGT AT T T AC CT AT GT T GGT 2 687 

N i I I I M I I II I I I I I I I I I I | | | | | || | M | | | | | | | | | | | | | | | | | || | | | | | | | | , 
3592 GAT GAT TT AGT T GAT T C T C T GAAGT T T GCAGT GTT GAT GT GG GT AT T T AC C TAT GT T G GT 3651 

2 688 GCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCT 2747 

M I I M I I I M II I I I I I I II M I I I II I I I II II M | | M I I I I 

3652 GCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCT 3711 

2748 GT TATTTATGAACGGCAT CAGGC GCAAATAGAT CATTAT CT GGGACTT GCAAATAAGAAT 2807 

N M II I I I M I I I I I I I I I I I I I I || || | | | | |M II I I I I I II I 

3712 GTTATTTAT GAACGGC AT CAGGCACAGATAGAT CATTAT CTAGGACTT GCAAATAAGAAT 3771 

2 8 08 GT TAAAGAT GC TAT GGCT AAAAT C CAAG CAAAAAT C C CT GGAT T GAAG C GTAAAG CT GAA 2 8 67 

I I I I I I I I I I I I I I I I | | | | | | | M I I I I I I I I I I I I I | | | M I I I I I I I I I I 

3772 GT TAAAGAT G C TAT GGCT AAAAT C CAAG CAAAAAT C C C T GGAT T GAAGC GCAAAG C T GAA 3831 

2868 T GAGAAAGCCT GAAAGAGTTAACAATAGAGGAGTTTAT CTTTAAAGGGGAT ATT CATTT G 2927 

1 I I I I I M M II I I I II I I I I || | | | | | | | | || | || | | | || 

3832 T GAAAACGC C CAAAATAATT A GTAGGAGTT CAT CTTTAAAGGGGATATT CATTT G 3886 

2928 AT T C CAT T G GGGAGGGT CAG GGAAGAACAAAGC C T T GAC AT T GCAGT GC AGT T T CAC 2 984 

IN I I I I I II I I I I I I I I I I I I I MM 

3887 AT TAT AC G G GG GAGGGT CAG GGAAGAAC GAA- C C T T GAC GT T GCAGT GCAGT T T C ACAGA 3945 

2985 AGAT CTTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAAT GAC CTGTCTTGA 3035 

I M II I I I II I II I || I | || | || | || M I II I II I II I || I | I M | 
3946 TCGTTGTTAGATCTTTATTTTTAGCCATGCACT GTT GTGAGGAAAAATT AC CTGTCTTGA 4005 

3036 CTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAAT 3095 

I I I I I I I I I I I I I I I I I I I II II I II I M I II II I I II II || II I I II I I II I II I II 
4006 CTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAAT 4 065 

3096 CAT AT TTGTTTTTCCT GT AT GAGGC ACT GGT GAAT AAACAAAGAT CT GAGAAAGC T GT AT 3155 

I M I I II I I || || | | | | | || || || I I I I I I I II I II I I I 

4066 CATA— TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTAT 4112 

3156 ATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

>'l I I I I I M II I I I I || | | || | | | | | | | | | | || || || | || || | || | 
4113 ATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4172 



Qy 3215 GAC AG AAAT AAC C C T T T T C AC AGT T T GT G C ACT GT GT AC G GT C T GT GT AG GT 32 66 

I I Ml II I II I I I || I | | || | M | | | | | | | | | | | | | | | | 
Db 4173 AGAAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGAT 4232 

QY 32 67 T GAT GCAGAT T T T CT GAAAT GAAA TGTTTAGACGAGATCATGCCACCAAGGCAGGA 3322 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I || I I 

Db 4233 T GAT GCAGAT T T T C T GAAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C G GTAAAG CAGG A 4292 

Qy 3323 GTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGT GAAAT TTACTGTTGT ATT 3382 

IN M I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4293 ATGACAAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATT 4351 

Qy 3383 AAT T GC CAAT ATAAGTAAAT AT AGATT AT AT AT AT CT AT AT ATAGTGT T T C AC GAAGC T T 3442 

I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I | | | | 

Db 4352 AATT GCCAAT ATAAGTAAAT AT AGATTAT AT AT G T AT AGT GT T T C AC AAAG C T T 4405 

Qy 34 43 AGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTT 34 9 6 

M I II II I I I I I I I I I I I | | M | | | | | | | || | || | | | | | | || | | | | 

Db 4406 AGAC CT T T AC C T T - C C AG C C AC C C CAC AGT G CT T GAT AT T T C AGAGT C AGT CAT T GGT T A 4464 

QY 34 97 TAT GT GT GT AGT C C CAAAG CACATAAG C T AG GGAGAAAC GT AC T T CT AG G C G CAC T AC C A 3556 

N I I I I I | || | | | | | M M I I II I I I I I I I I I I I M I I I I I 

Db 44 65 T ACAT GT GT AGT T C CAAAG CACATAAG C T AGAAGAAGAAAT AT T T CT AGG AG CAC T AC C A 452 4 

Qy 3557 T C T GT T T T CAACAC GAAC C GAC GC CAT G CAAAC AGAACT C C - T CAAC ATAAAC T T CAC T G 3615 

I I I I I I I I I I I I I I I I Mill II I II I I I I M | | M || || | || | | | 
Db 4525 TCTGTTTTCAACATGAA AT GC CACACACAT AGAACT C CAACAACAT CAATTT C AT T G 4581 

Qy 3616 CAC AGAC T T AC T GT AGT TAAT T T TAT CAC — AAACTCTGGACTGAATCTAATGCTTCCAA 3673 

I I M I I I I I I I M I I I II I I I I I MM II II I I I I I I M II I I I II II II I I II 
Db 4582 C AC AGACT GACT GT AGT TAATT TT GT C AC AGAAT C TAT G GACT GAAT CTAAT GC T T C CAA 4641 

Qy 3674 AAA T GTTT GCAAATATCAAACATT GTTAT GTAAGAAAATAT AAAT 3718 

I I I I I I M M M M I II II I II I II I I II I I I I I M II I MM 

Db 4642 AAAT GTTGTTTGTTT GCAAAT AT CAAAC AT T GT T AT G CAAGAAAT TAT TAAT T AC AAAAT 4701 

Qy 3719 GAC GAT T T AT ACAAT T GT G GT T TAAGCT GT AT T GAAC T AAAT C T GT G GAAT GC AT T GT GA 377 8 

II I I I II II M I II II I I I I I I II I I I I I I I II II I I I I I I I I I M || I || | M I II 

Db 47 02 GAAGAT T TAT AC CAT T GTG GT T T AAG CT GT AC T GAAC T AAAT C T GT GGAAT GC AT T GT GA 4 7 61 

Qy 3779 AC T GT AAAAG C AAAGT AT C AAT AAAGC T T AT AG ACT T AAAAAAAAAAAAAAAAAA 38 33 

I M I I I I I I I I I II II II II II I I II M I II II I I I II I II I I || | 

Db 4 7 62 AC T GTAAAAGC AAAGT AT CAAT AAAGC T TAT AG ACT TAAAANGNNANNAGN GAAA 4816 



RESULT 2 

US-09-023-655-382 

; Sequence 382, Application US/09023655 
; Patent No. 6607879 
; GENERAL INFORMATION: 

APPLICANT: Cocks, Benjamin G. 
; APPLICANT: Susan G. Stuart 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF BLOOD CELL GENE 
; TITLE OF INVENTION: EXPRESSION 
NUMBER OF SEQUENCES: 150 8 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

STREET: 3174 PORTER DRIVE 
; CITY: PALO ALTO 

STATE: CALIFORNIA 

COUNTRY: USA 
; ZIP: 94304 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 09/ 023 , 655 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 
; FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 
; REFERENCE/DOCKET NUMBER: PA-0001 US 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
INFORMATION FOR SEQ ID NO: 382: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2 610 base pairs 
; TYPE: nucleic acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

IMMEDIATE SOURCE: 

LIBRARY: LUNGNOT14 

CLONE: 1508778 
US-09-023-655-382 

Query Match 20.6%; Score 789.8; DB 4; Length 2610; 

Best Local Similarity 84.5%; Pred. No. 5.2e-190; 

Matches 1021; Conservative 0; Mismatches 138; Indels 4 9; Gaps 10; 
Qy 2304 GT T GT T GAC CT C C T CT AC T G GAGAGAC AT T AAGAAGACT G GAGT GGT GTT T G GT GC C AG C 2363 



Db 



1311 



I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I M I I 
GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 1370 



Qy 



2364 



TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTAC-ATTGC 2422 




Db 



1371 



C TAT TCCTGCTGCTTT CAT T G AC AGT AT T C AG CAT T GT GAG C GT AAC AGC CT ACAAT T G C 1430 



Qy 



2423 



CTTGGCCCT GCT CT C T GT GAC TAT C AG C T T T AGGAT AT ATAAG GGT GT GAT C C AGG C TAT 24 82 






Db 



1431 



CTTGGCCCT GCT C T C T GT GAC CAT C AG C T T TAG GAT AT ACAAG G GT GT GAT C CAAG CT AT 14 90 



Qy 



2483 



C CAGAAAT C T GAT GAAG GC CAC C CAT T C AGG GC AT AT T T G GAAT CT GAAGT T G CT AT AT C 2542 





Db 



1491 



C CAGAAAT CAGAT GAAGGC CACCCAT T CAGGGCATAT CT GGAAT CT GAAGTT GCT ATAT C 1550 



Qy 2543 TGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATA^ 2602 

I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1551 T GAG GAGT T G GT T C AGAAGT AC AGT AAT TCTGCTCTTG GT CAT GT GAAC T G C AC GAT AAA 1610 

Qy 2 603 AGAACT C AGAC GC C T CT T C T T AGT T GAT GAT T T AGT T GAT T CT CT GAAGT T T G C AGT GT T 2 662 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 1611 GGAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTT 167 0 

Qy 2663 GATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGC 2722 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1671 GATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGC 1730 

Qy 2723 T C T GAT T T CACT CT T C AGT GTT C CT GT T AT T TAT GAAC GGCAT CAG GC GCAAAT AGAT C A 2782 

Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I II I I I I I I I I 
Db 1731 T CT CAT T T CAC T CT T C AGT GTT C C T GT T AT T TAT GAAC G GC AT C AG GC AC AGAT AGAT CA 1790 

Qy 27 83 T TAT CT G GGACT T G CAAAT AAGAAT GT T AAAGAT GC T AT G G C T AAAAT C CAAG CAAAAAT 2 842 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 1791 T TAT C TAGGACT T GCAAAT AAGAAT GT T AAAGAT G CT AT GGCTAAAAT C CAAG CAAAAAT 1850 

Qy 284 3 C C CT GGAT T GAAGC GT AAAG C T GAAT GAGAAAGC C T GAAAGAGT T AACAAT AGAGGAGT T 29 02 

I M I II I I I I I I I I II I I I I II I I I I I I I I I I II I I I I I I I I I I I 

Db 1851 CCCTGGGTT GAAGC GCAAAGCT GAAT GAAAAC GC C CAAAATAATT A GTAGGAGTT 1905 

Qy 2903 TAT CTTT AAAG GGGAT AT T CAT T T GAT T C CAT T G GGGAGG GT CAG G GAAGAACAAAG C C T 2 962 

I I I I I I I I I I I I I I M I I I I I I I I I II I I I I M III I I I M 

Db 1906 CAT C TT TAAAGGGGAT ATT C AT T T GAT T TAT AC G G G GGAGGGT CAG G GAAGAAC GAAC C T 19 65 

Qy 2963 T GACAT T G CAGT GCAGT TT CAC AGAT C T T TAT TT T TAG CAAC G C AGT G- 3010 

I I M I I I I I I I I I I I I I I I I I II I I I I I I I II I II I I I I Ml || 

Db 1966 T GAC GT T GCAGT GCAGT TT CAC AGAT C GT T GT T AGAT CT T T AT TT T TAG C CAT GC ACT GT 2025 

Qy 3011 TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCT 3 07 0 

I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I II I I I I I I I I I I I I 
Db 2 02 6 TGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCT 2 08 5 

Qy 3071 GCT AT GT AT G GAT T T AAAT C GT AAT CAT AT T T GT T T — TTCCTGTATGAGGCACTGGTGA 312 8 

I I I II I I I I II I II I I I I I I I I I I I I I II I II I I || I I I I I | I | | I I I || 
Db 2 086 GCT AT GT AT GGAT T TAAAC C GTAAT CAT AT CTTTTTCC TAT C TAT CT GAGG C AC T GGT G G 214 5 

Qy 3129 ATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT- 318 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2146 A ATAAAAAAC C T GT AT AT T T T ACT TT GT T GC AGAT AGT C T T GC C GC AT C 2194 

Qy 318 8 TTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAAT AAC C CT T T T CAC AGT T T 324 0 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I 

Db 2195 T T GG CAAGT T GCAGAGAT G GT G GAG CT AGAAAAANAAAGAAAAAGAGC C C T T T T CAGT T T 2254 

Qy 3241 GT G CACT GT GT AC GGT C T GT GT AG GT T GAT G C AGAT T T T C T GAAAT GAAA TGTTTA 3296 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 2255 GTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAAT GAAAT GTTTGTTTA 2314 

Qy 3297 GAC GAGAT CAT G C CAC C AAGG C AG GAGT GAAAAAG CT T GCCTTTCCT GGT AT GTT CT AG G 3356 

M I I I I I I I I I I I I | Ml | I I I | | | | | M | I I I I I I I I 

Db 2315 GAC GAGAT CAT AC C CG GTAAAGCAGGAAT GACAAAGCT T GCTTTTCTG GT AT GTT C TAG G 2374 



Qy 



3357 




Db 



2375 



Qy 



3417 




Db 



2431 



Qy 



3477 



GATACTTC 3484 



Db 



2489 




RESULT 3 
US-08-700-607-2 

; Sequence 2, Application US/08700607 

; Patent No. 5858708 

; GENERAL INFORMATION: 

; APPLICANT: Bandman, Olga 

; APPLICANT: Au-Young, Janice 

APPLICANT: Goli, Surya K. 

APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 
; COUNTRY: U.S. 

ZIP : 94304 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/7 00, 607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME : Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 799 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 

LIBRARY: 
; CLONE: Consensus 

US-08-700-607-2 



Query Natch 15.4%; Score 590.8; DB 2; Length 799; 

Best Local Similarity 92.9%; Pred. No. 7e-140; 

Matches 643; Conservative 0; Mismatches 42; Indels 7; Gaps 2; 



Ov 


2304 


KJ± 1 OJ - 1 wv^ui li^i tiALi (jAbi AUAL, A 1 1 AAGAAGACTGGAGTGGTGTTTGGTGCCAGC 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 II | | | | || | || | || | M 1 1 1 1 II 1 1 1 1 | 1 1 | | 1 1 1 1 | | | | | | | | 
GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 


2363 


Db 


108 


167 


Ov 


2364 


TTf^TT P PTPPTPPTPTPPPTP Zi rflrT A'PfPr7\r'P7irTirTir , rTiPT\/"nir'm7iT\n^/i^^mi\/nimrT 1 ^rt^ 

1 1 vji i kjk^i b^iLiLbUi bALAb 1 Al 1 LAbLAl rGTGAGTGTAACGGCCTACATTGCC 

1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CT ATT CCT GCT GCTTTCATT GACAGT ATT CAGC ATT GT GAGC GTAACAGCCTACATT GCC 


2423 


Db 


168 


227 


Ov 


2424 


i i u^o^v^i (j^i ^i^ibibALiAI LAGL1 1 1 AGGA1 ArATAAGGGTGTGATCCAGGCTATC 

1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 M | 1 1 1 1 1 1| 1 1 1 1| 1 1 1 1 1| | | 

TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 


2483 


Db 


228 


287 


Ov 


2484 


LAbAAAiLibAi GAAGGLLAULCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 i ii i ii 1 1 1 1 ii ii 1 1 ii i ii ii 

CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 


2543 


Db 


28 8 


347 


Ov 


2544 


i i LAbAAbi ALAbLAAl 1 Ul GC1 CTTGGTCATGTTAACTGCACAATAAAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 ii 1 1 1 1 1 

GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 


2603 


Db 


34 8 


407 


Ov 


2 604 


bAAL i LAbALbLL 1 L i 1 LI 1AG1 1 GATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 ii 1 1 1 1 1 1 1 1 

GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 


2663 


Db 


408 


467 


Ov 


2664 


i lALLiAlbl 1 Gbjl bLLl 1 Gl I CAATGGTCTGACACTACTAATTTTGGCT 

1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 


2723 


Db 


468 


527 


Ov 


2724 


u i VjAJ. 1 1 LAb lull UAG ibl 1 GG 1 G 1 TATTTATGAACGGCATCAGGCGCAAATAGATCAT 

ii 1 1 1 1 1 ii i ii 1 1 1 1 1 1 1 ii 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 

CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 


2783 


Db 


52 8 


587 


Ov 


2784 


m Z\ r T'P r T , PPPfl.r* rni T , f^ir*7A 7A7\ r P7AZir , 7A7\ m r r ~ ,r P l T , 7\A 7\r , 7\rrir , r , m7\rnr , r , rTTn\ 7\ t\ 7tmon7v r~>r-*-* -* -t> -» -» m^. 

milji bbbAL. i ibbAAAiAAbAAlbl 1 AAAGA1 GCTATGGCTAAAATCCAAGCAAAAATC 

HIM MINI 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I | | | | M 

tatctaggacttgcaaataagaatgttaaagatgctatggctaaaatccaagcaaaaatc 


2843 


Db 


SR R 


647 


Qy 


2844 


cctggattgaagcgtaaagctgaatgagaaagcctgaaagagttaacaatagaggagttt 
1 M 1 II 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 
cctggattgaagcgcaaagctgaatgaaaacgcccaaaataatta gtaggagttc 


2903 


Db 


648 


702 


Qy 


2904 


atctttaaaggggatattcatttgattccattggggagggtcagggaagaacaaagcctt 

1 N 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

atctttaaaggggatattcatttgattatacgggggagggtcagggaagaacga--cctt 


2963 


Db 


703 


760 


Qy 


2964 


GACATTGCAGTGCAGTTTCACAGATCTTTATT 2995 

II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

GAC GT T G C AGT G CAGT T T C AC AGAT C GT T GT T 792 




Db 


761 





RESULT 4 

US-09-621-976-3839 

; Sequence 3839, Application US/09621976 



; Patent No. 6639063 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, J.B. 

; APPLICANT: Jobert, S. 

; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
; FILE REFERENCE: GENSET . 054PR2 
; CURRENT APPLICATION NUMBER: US/ 09/ 62 1 , 97 6 
; CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
; SEQ ID NO 3839 

LENGTH: 536 
; TYPE : DNA 

ORGANISM: Homo sapiens 
; FEATURE : 

NAME/KEY: CDS 

LOCATION: 3.. 224 
US-09-621-976-3839 



Query Match 7.7%; Score 293.4; DB 4; Length 536; 

Best Local Similarity 82.8%; Pred. No. l.le-64; 

Matches 443; Conservative 0; Mismatches 56; Indels 36; Gaps 8; 

TGTTTAGACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGT 3350 
I M I I I I I I I I I I I I I I II II HUM III I I I I I I I I I Ml I I I I I I I | | 
T GT T T AGAC GAGAT CAT AC C G GTAAAG C AGGAAT GACAAAGC T TGCTTTT-CTG GT AT GT 7 1 

T CT AGGT GTATT GTGAAATTT ACT GTT GTATTAAT T GC CAATATAAGTAAAT ATAGATT A 3410 

I I I I I I I I I I I I I I I I I I I I I II I I I I I II I II I I I I I I I I || || | | | | | | | | | | || 
T CT AGGT GTATT GTGACTTTT ACT GTTATATTAAT T GC CAATATAAGTAAATAT AGATT A 131 

TAT AT AT C T AT ATAT AGT GTT T C AC GAAGCT T AG C C CT T T AC CT T CC C AGCT GC C C C AC A 34 7 0 

II I I I M I I I M I I I I I I II I II I | | || M II II I I I I I II I I II I | | 

TA T AT GTAT AGT GT T T C ACAAAG CT T AGAC CT T TAC CT T - C C AGC C AC C C C AC A 184 

GTGCTTGATACT- T C T GT CAT GG GT T T TAT GT GT GT AGT C C CAAAG C ACAT AAG C 3524 

I I I I I I I I I I I I I M I I I I I I I II M I II I I I II I I I I I 

GT GC T T GAT ATT T CAGAGT C AGT CAT T GGT TAT ACAT GT GT AGT T C CAAAG C AC AT AAG C 244 

T AGG G AGAAAC GT ACT T C TAG G C G C AC TAC CAT CT GTT T T C AACAC GAAC C GAC G C CAT G 358 4 
HI M I I I I I I I I I I | || | | | | | | | | | | | | | | | | || | | | || 
T AG AAGAAGAAAT AT T T C TAG GAG C AC TAC CAT C T GT T T T C AAC AT GAAAT G C C A 2 99 

CAAAC AGAAC T C CT CAAC ATAAAC T T C ACT GCAC AGAC T TACT GTAGT T AAT T TT AT C AC 3 64 4 

II 'I' I I I M I I I I I I I I I M I 1 I II I I I I I I I I II I I II II I I I I I I 
C AC AC AT AGAAC T C CAAC AT CAAT T T CAT T GCAC AGAC T GACT GT AGT T AAT T TT GT C AC 359 

— AAACTCTGGACTGAATCTAATGCTTCCAAAAATGTT T GCAAAT AT CAAAC A 3 695 

M II I I I I I I I I I H I I I M I I II I II I I I I I I I I I I I I I I I I I I I I I I 

AGAAT C TAT GGACT GAAT CT AAT G CT T C CAAAAAT GTTGTTTGTTTG CAAAT AT CAAAC A 419 

T T GT T AT GT AAGAAAAT AT AAAT GAC GAT T T AT ACAAT T GT G GT T T AAGC T G 374 7 

I I I M I I I I I I I I I I M M I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

T T GT TAT GCAAGAAATT AT T AAT T ACAAAAT GAAGAT T TAT AC CAT T GT G GT T T AAGC T G 4 79 



Qy 


3291 


Db 


13 


Qy 


3351 


Db 


72 


Qy 


3411 


Db 


132 


Qy 


3471 


Db 


185 


Qy 


3525 


Db 


245 


Qy 


3585 


Db 


300 


Qy 


3645 


Db 


360 


Qy 


3696 


Db 


420 



Qy 



3748 TATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAA 38 02 



I I I N I I II I I I I I I I I I I I I I I | | | | | | | | | | | m I I I I I | | | | | | | | | | M I 

Db 480 TACT GAACTAAAT CT GT GGAAT GCATTGT GAACT GTAAAAGCAAAGTAT CAATAA 534 

RESULT 5 

US-09-621-976-3840 

Sequence 3840, Application US/09621976 
Patent No. 6639063 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Jobert, S. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET. 054PR2 
CURRENT APPLICATION NUMBER: US/09/621,976 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
SEQ ID NO 3840 
LENGTH: 508 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: 3.. 224 
US-09-621-976-3840 

Query Match 6.6%; Score 254.4; DB 4; Length 508; 

Best Local Similarity 81.5%; Pred. No. 8.2e-55; 

Matches 404; Conservative 0; Mismatches 56; Indels 36; Gaps 8; 

T GT T T AGAC GAGAT CAT GC C AC CAAG GC AGGAGT GAAAAAGCT T GCCTTTCCTG GT AT GT 3350 

1 M I I I M I I MINI I I I I II I I I | || | M I I I I I I I I I 



II I N I I I I I I I I I. I I I I I II I II I I I I I I I I I II I I 



II I I I I I I I I I I I I I M I I I I I I | | || | | | | | | | | | | | | | | || 

TA TAT GT AT AGT GT T T C ACAAAG C T T AGAC C T T T AC CT T - C C AGC C AC C C C AC A 18 4 

GTGCTTGATACT T CT GT CAT GGGTTT T AT GT GT GT AGT C C CAAAGCAC AT AAGC 3524 

I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I 

GT GC T T GAT AT TT C AGAGT C AGT CAT T G GTT AT AC AT GT GT AGT T C CAAAGCAC AT AAGC 2 44 

T AGG GAGAAAC GT AC T T CT AG G C G C AC TAC CAT CT GT T T T C AAC AC GAAC C GAC GC C AT G 3584 
HI II M I I I I II I I I I M I | | | || | | M | | || | | | || | || 
T AGAAGAAGAAAT AT T T C T AGGAGCAC TAC CAT C T GT T T T CAAC AT GAAAT G C C A 2 9 9 

CAAACAGAAC T C C T C AAC ATAAACT T CAC T G C AC AGACT T AC T GT AGTT AAT T T TAT C AC 3 644 

' ' 'I' I I I I I I I I M I I I I I I I I I || || | | | | | | | | | | || | | | | | | | | 

C AC AC AT AGAAC T C CAAC AT C AAT T T CAT T GC AC AGAC T GAC T GT AGTTAAT T T T GT CAC 35 9 



Qy 


3291 


Db 


13 


Qy 


3351 


Db 


72 


Qy 


3411 


Db 


132 


Qy 


3471 


Db 


185 


Qy 


3525 


Db 


245 


Qy 


3585 


Db 


300 


Qy 


3645 



II II I I I I I I I II I I I I II I I I I I Mill I I I I I I I 



360 AGAATCTATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTGCAAATATCAAACA 419 



Qy 3696 TTGTTATGTAAGAAAATAT AAAT GAC GATT T AT ACAAT T GT G GT T TAAGCT G 3747 

I I I I I I I I 111,1' Ml I II I II MINIMI I I M I I M I M M I II 

Db 420 T T GT T AT G CAAGAAAT TAT T AAT T ACAAAAT GAAGATT T AT AC CAT T GT G GT T TAAGCT G 479 

Qy 3748 TAT TGAACT AAAT CTG 37 63 

II II II II I II II II 

Db 480 T AC T GAAC T AAAT CTG 495 



RESULT 6 

US-09-149-476-254 

; Sequence 254, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 186 Human Secreted proteins 

; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149, 476 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/04 493 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,336 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,163 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/047,600 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,615 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,597 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,502 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,633 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,583 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,617 

; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,618 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,503 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,592 

; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 



EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 



; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,578 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/043,576 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/047,501 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,670 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/056,632 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,664 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,876 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,881 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,909 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,875 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,862 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,887 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,908 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/048,964 

; EARLIER FILING DATE: 1997-06-06 

; EARLIER APPLICATION NUMBER: 60/057,650 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/056,884 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/057,669 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

; EARLIER APPLICATION NUMBER: 60/061,060 

EARLIER FILING DATE: 1997-10-02 

Query Match 6.0%; Score 230.4; DB 4; Length 1766; 

Best Local Similarity 63.6%; Pred. No. 1.8e-48; 

Matches 351; Conservative 0; Mismatches 201; Indels 0; Gaps 0; 

Qy 2300 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 2359 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 286 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 345 

Qy 2 360 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACAT 2419 

II I I I I I I II I I I I I I I I I I I Mill I II I I I I I I I I I 

Db 34 6 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 4 05 

Qy 2420 TGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGC 2479 

I M I I II I II I I I I I II I I I I I I II I I I I I II III II I I I I I II 



Db 4 06 CATCCTGGCTCTTCTCTCTGTCACCATCAGCTTCAGGATCTACAAGTCCGTCATCCAAGC 4 65 

Qy 2480 TAT C CAGAAAT C T GAT GAAGG C CAC C CAT T CAG GGCAT AT T T G GAAT C T GAAGT T GC T AT 2539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 4 66 T GT ACAGAAGT C AGAAGAAG GC C AT CC AT T CAAAGC CT AC C T G GAC GT AG AC AT TACT C T 525 

Qy 254 0 ATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAAT 2599 

II II I I I I I I I I I I M I I I I II II I I I I I I I 

Db 526 GT C C T C AGAAGCT T T C C AT AAT T AC AT GAAT G CT GC CAT GGT G CAC AT CAAC AGGGC C C T 585 

Qy 2600 AAAAGAACT C AGAC GC C T CT T C T T AGT T GAT G AT TT AGT T GAT T CT CT GAAGT T T G C AGT 2659 

III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 58 6 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 645 

Qy 2660 GTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2719 

I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I II I 
Db 64 6 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 7 05 

Qy 272 0 GGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGA 277 9 

III I I I I II I I I I I I I I I I I I I I II I I I I I I I I I 

Db 7 06 T GCT GAACT GC T CAT T T T C AGT GT C C C GAT T GT C TAT G AGAAGT ACAAGAC C C AGAT T GA 7 65 

Qy 2780 T CAT TAT C T GGGACTT G CAAAT AAGAAT GT T AAAGAT GC T AT GG CTAAAAT C CAAGC AAA 2839 

I II I I I I I I I II I I II III I I I I II I I I I I I I 

Db 766 T CACT AT GT T G GCAT C GC C C GAGAT C AGAC CAAGT CAAT T GT T GAAAAGAT C C AAG C AAA 825 

Qy 2840 AATCCCTGGATT 2 851 

I I I I I I I I I I 
Db 826 AC T C C CT GGAAT 837 



RESULT 7 

US-09-149-476-255 

; Sequence 255, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 18 6 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149,47 6 
; CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US 98/ 044 93 
EARLIER FILING DATE: 1998-03-06 
EARLIER APPLICATION NUMBER: 60/040,162 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 



EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 



EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 



; EARLIER FILING DATE : 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

; EARLIER APPLICATION NUMBER: 60/061,060 

; EARLIER FILING DATE: 1997-10-02 

Query Match 6.0%; Score 230.4; DB 4; Length 2664; 

Best Local Similarity 63.6%; Pred. No. 2.2e-48; 

Matches 351; Conservative 0; Mismatches 201; Indels 0; Gaps 0; 

Qy 2300 T T C AGT T GTT GAC C T C C T C TACT G GAGAGACAT T AAGAAGAC T GG AGT G GT GT T T G GT GC 2359 

I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 261 T GC GGT G C AC GAT CT GAT T T T CT GGAGAGAT GT GAAGAAGAC TGGGTTTGTC T TT GGC AC 32 0 

Qy 2360 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACAT 2419 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 321 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 38 0 

Qy 2420 TGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGC 2479 

I I I I I II I I I I I I I I II II I I I I I I I I I I I II III II II I I I II 
Db 381 CAT CCTGGCTCTTCTCTCTGT C AC CAT C AGC T T C AGGAT CT ACAAGT C C GT CAT C CAAGC 44 0 

Qy 248 0 TAT C C AGAAAT C T GAT GAAGGC C AC C CAT T C AG G G CAT AT T T G GAAT CT GAAGTT GCT AT 2539 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I MM M I I I I I 

Db 441 T GT AC AGAAGT C AGAAGAAG GC CAT C CAT T C AAAGC C T AC CT GGAC GT AGAC ATT ACT CT 500 

Qy 2540 AT CT GAG GAGT T GGT T C AGAAGT ACAGCAAT TCTGCTCTTGGT CAT GT TAACT GC ACAAT 2599 

II II I I I I I I I II I I I I I I I II II I II I I I I 

Db 501 GT C CT C AGAAG CT T T CC ATAAT T ACAT GAAT G CT GC CAT G GT G CAC AT CAAC AGG G C C CT 560 

Qy 2600 AAAAGAACT CAGAC GCCT CT T CT T AGT T GAT GAT TTAGT T GAT T CT CT GAAGT TT G C AGT 2659 

III II I I I I II I I I I I I I II I I I I II I I I I I I I I II I I 

Db 561 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 62 0 

Qy 2660 GTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2719 

I I I II I I I I I I I I II I I I I M I I I I I I I I I I I II II M I I I I I 
Db 621 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 680 

Qy 272 0 GGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGA 27 7 9 

III I I I II I II I I II II I I I I I I II II I II M II 

Db 681 T GCT GAACT GCT C AT TT T C AGT GT C C C GAT T GT C TAT GAGAAGT ACAAGAC C C AGAT T GA 740 

Qy 2780 T CAT TAT CT G G G ACT T GCAAAT AAGAAT GT T AAAGAT G C TAT GGC T AAAAT C CAAG C AAA 2839 

I II I I I I I I III I I II III I I I I I II I I I I I I 

Db 741 T C ACT AT GTT GG C AT CG C C C GAGAT CAGAC CAAGT CAAT T GT T GAAAAG AT C CAAG C AAA 800 

Qy 284 0 AATCCCTGGATT 2 851 

I I II I M I I I 
Db 801 ACT C C CT G GAAT 812 



RESULT 8 
US-08-700-607-4 

; Sequence 4, Application US/08700607 

; Patent No. 5858708 

; GENERAL INFORMATION: 

; APPLICANT: Bandman, Olga 



APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
; NUMBER OF SEQUENCES: 9 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 
; TELEFAX: 415-845-4166 

; INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 1095 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 

LIBRARY: THPlNOBOl 
; CLONE: 31870 

US-08-700-607-4 

Query Match 5.3%; Score 202.2; DB 2; Length 1095; 

Best Local Similarity 63.7%; Pred. No. 1.9e-41; 

Matches 303; Conservative 1; Mismatches 172; Indels 0; Gaps 0; 
Qy 2300 T T C AGT T GT T GAC C T C C T CT ACT G GAGAGAC AT T AAGAAGAC T G GAGT G GT GTT T GGT G C 2359 



Db 



328 




387 



Qy 



2360 



CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACAT 



2419 



Db 



388 




447 



QY 



2420 



TGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGC 



2479 



Db 



448 




507 



Qy 



2480 



TAT C CAGAAAT CT GAT GAAGGC CAC C CAT T CAGGGCATAT TT GGAAT CT GAAGTT GCT AT 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I 



2539 



Db 



508 T GT ACAGAAGT CAGAAGAAGGC CAT C CAT T C AAAGC CT AC C T G GAC GT AGAC AT TACT C T 567 



Qy 254 0 ATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAAT 2599 

II II I I I II I I I I I I I I I I I II II I I I I I I I 

Db 568 GT CC T C AGAAG CT T T C CAT AAT T AC AT GAAT GC T G C CAT G GT G C AC AT CAAC AG G G C C CT 627 

Qy 2600 AAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGT 2659 

III II I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I 

Db 62 8 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 687 

Qy 2660 GTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2719 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68 8 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 747 

Qy 2720 GGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAA 2775 

III I I II I I I I I I I II I III I II III III 

Db 74 8 TGCTGAACTGCTCATTTTNAGTGTCCCGATTGTNTATNAGAAGTACAAGGTTCCAA 8 03 



RESULT 9 

US-09-149-476-102 

; Sequence 102, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

TITLE OF INVENTION: 186 Human Secreted proteins 

FILE REFERENCE: PZ002P1 
; CURRENT APPLICATION NUMBER: US/09/149 , 47 6 
; CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US98/ 044 93 
; EARLIER FILING DATE: 1998-03-06 
; EARLIER APPLICATION NUMBER: 60/040,162 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,502 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 



EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 



; EARLIER APPLICATION NUMBER: 60/047,586 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,590 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,594 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,578 

EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/043,576 
; EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/047,501 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/043,670 
; EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/056,632 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,664 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,876 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,881 
; EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,909 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,875 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,862 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,887 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,908 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/048,964 
; EARLIER FILING DATE: 1997-06-06 
; EARLIER APPLICATION NUMBER: 60/057,650 
; EARLIER FILING DATE: 1997-09-05 

EARLIER APPLICATION NUMBER: 60/056,884 
; EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/057,669 

EARLIER FILING DATE: 1997-09-05 

EARLIER APPLICATION NUMBER: 60/049,610 
; EARLIER FILING DATE: 1997-06-13 
; EARLIER APPLICATION NUMBER: 60/061,060 
; EARLIER FILING DATE: 1997-10-02 

Query Match 4.7%; Score 182; DB 4; Length 794; 

Best Local Similarity 61.2%; Pred. No. 2.2e-36; 

Matches 333; Conservative 6; Mismatches 201; Indels 4; Gaps 3; 

Qy 2300 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 2359 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 



Db 



253 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTG — GA 310 



Qy 2360 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCC-TACA 2418 

II II II I I I I I II II 111 II I I II I I I I I : I II Ml: 
Db 311 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCARTGTGGGTTTCTTAMC 37 0 

Qy 2 419 TTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGG 24 7 8 

I I I I I I II I I I I I I I I II 1111:111 I I I I I II III II I I I I I I 
Db 371 TCATCCTGGCTCTTCTCTCTGTCACCATCARCTTCAGGATCTACAAGTCCGTCATCCAAG 430 

Qy 2479 CT AT C CAGAAAT CT GAT GAAGG C CAC C CAT T - CAG G GC AT AT T T G GAAT C T GAAGT T G C T 2537 

I I I I II I I : I I I I : I I I I I i I I I I : I I I I I I I I I I I II I I I I 

Db 431 CT GTW CAGAART CAGAARAAGGC CAT C C AWT C CAAAGC CT AC C T GGAC GT AGAC AT T AC T 490 

Qy 2538 ATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACA 2597 

III II I I I I I I I I I I II I I I I II II Mill I 

Db 4 91 C T GT C C T C AGAAGCT T T C C AT AAT T AC AT GAAT GC T GC CAT G GT GC AC AT CAAC AG G G C C 550 

Qy 2598 ATAAAAGAAC T CAGAC GC C T CT T CT T AGT T GAT GAT T T AGT T GAT T CT CT GAAGT T T G C A 2657 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 551 CTGAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCT 610 

Qy 2658 GTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATT 2717 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 611 GTCTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATT 670 

Qy 2718 TTGGCTCT GAT T T C ACT CT T CAGT GT T C C T GT TAT T TAT GAAC GG CAT C AGGC GCAAAT A 2777 

I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I 

Db 671 CT T G C T GAACT GCT CAT TT T CAGT GT C C C GAT T GT C TAT GAGAAGT ACAAG AC C C AGAT T 730 

Qy 2778 GAT CAT TAT CT G GGACT T GCAAAT AAGAAT GT TAAAGAT GCT AT G G CT AAAAT C CAAGC A 2837 

M I I I I I I I I I III I I M Ml M MM I 

Db 731 GAT CACT AT GTT GGCAT CGC C CGAGAT CAGAC CAAGT CAATT GTT GAAAAGAT CC CAAGC 790 

Qy 2838 AAAA 2841 

I I I I 

Db 791 AAAA 794 



RESULT 10 
US-08-700-607-9 

; Sequence 9, Application US/08700607 
; Patent No. 5858708 

GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
; APPLICANT: Au-Young, Janice 

APPLICANT: Goli, Surya K. 

APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 261 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 

LIBRARY: SPLNFET01 
CLONE: 28742 
US-08-700-607-9 

Query Match 4.5%; Score 174.2; DB 2; Length 261; 

Best Local Similarity 89.7%; Pred. No. 1.2e-34; 

Matches 182; Conservative 0; Mismatches 21; Indels 0; Gaps 0; 

Qy 2363 CTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGC 2422 

I I I || (Mill II I II I I I I I II I I I II I I I I M I I I I I I I II I I I M I I I 
Db 1 C CT AT NCCNGCTGCTTT CAT T GAC AGT ATT CAGC AT T GT GAG C GTAAC AGC CT AC AT T GC 60 

Qy 2423 CTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTAT 24 82 

Ml I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I 
Db 61 CTTNGCCCTGCNCTCTGTGACCATCAGCTNTAGGCTATACT^AGGGTGTGATCCAAGCTAT 120 

Qy 2 4 83 C C AGAAAT CT GAT GAAGGC C AC C CAT T CAG GGC AT AT T T GGAAT C T GAAGT T G CT AT AT C 2542 

I I I I I I I I I I 1 I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 1 I I I I 
Db 121 C C AGAAAT CAGAT GAAG GN CAC C CAT T CAG G GC AT AT C T GGAN T CT GAAGT T GC T AT AT C 180 

Qy 2543 T G AGG AGT T G GT T C AGAAGT AC A 2565 

I I I I I I I I I I I II I II I I I I I I 
Db 181 T GAGGAGT T GNT T C AGAAGT AC A 203 



RESULT 11 
US-09-439-313-279 

; Sequence 279, Application US/09439313 
; Patent No. 6329505 
; GENERAL INFORMATION: 

APPLICANT: Xu, Jiangchun 
; APPLICANT: Dillon, Davin C. 
; APPLICANT: Mitcham, Jennifer L. 
; APPLICANT: Harlocker, Susan Louise 



APPLICANT: Jiang Yuqui 
APPLICANT: Reed, Steven G. 
APPLICANT: Kalos, Michael 
APPLICANT: Fanger, Gary 
APPLICANT: Retter, Mark 
APPLICANT: Solk, John 
APPLICANT: Day, Craig 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THERAPY AND 
TITLE OF INVENTION: DIAGNOSIS OF PROSTATE CANCER 
FILE REFERENCE: 210121. 427C9 
CURRENT APPLICATION NUMBER: US/09/439, 313 
CURRENT FILING DATE: 1999-11-12 
NUMBER OF SEQ ID NOS : 575 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 301 
TYPE: DNA 

ORGANISM: Homo sapien 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: (1) . . . (301) 
OTHER INFORMATION: n = A,T,C or G 
US-09-439-313-279 

Query Match 3.6%; Score 138.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.3e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 

Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II MINI II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 AAAGCAGGAATGACAAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTAC 59 

Qy 3374 T GTT GTAT TAATT GC CAATATAAGTAAATATAGATTAT ATATAT CT ATAT AT AGT GTT T C 3433 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I III I I I I I I I I I II 

Db 60 T GT TAT AT T AAT T GC C AAT AT AAGT AAAT AT AG AT TATA TATGTATAGTGTTTC 113 

Qy 34 34 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 3487 

I I I I I I I I 1 I I I I I I I I I I I I II I I I I I I I I I I II II I I I I I I MM 
Db 114 AC AAAGCT T AGAC CT T T AC C TT - C CAGC C AC C C C ACAGT GCT T GAT AT T T C AGAGT CAGT 172 

Qy 34 8 8 CATGGGTTTTATGTGTGTAGTCCCAAAGCACAT7VAGCTAGGGAGAAACGTACTTCTA-GG 354 6 

I I I I I I I I I I M I II I I II I I I I I I M II I I I I I I II II I I I II I I 

Db 173 CATT GGTTAT ACAT GT GT AGTT C CAAAGCACATAAGCT AGAANAANAAATAT TT CTAGGG 232 

Qy 3547 C G C ACT AC CAT CT GT T T T CAAC AC GAAC C GAC GC CAT G CAAACAGAACT C CT CAACAT AA 3606 

I I II M I M I I II I I I I I I I I I I M I I 1 I I II I I I I I I M I I 

Db 233 AGC AC T AC CAT CT GT T T T C A CAT GAAAT GCCACACACATAGAACT C — CAACAT C A 286 

Qy 3607 ACTTCACTGCACAGA 3621 

I I II I I I I II I II 
Db 287 AT T T CAT T G C AC AG A 301 



RESULT 12 

US-09-352-616A-279 

; Sequence 279, Application US/09352616A 
; Patent No. 6395278 



GENERAL INFORMATION: 
APPLICANT: Dillon, Davin C. 
APPLICANT: Harlocker, Susan Louise 
APPLICANT: Jiang, Yuqui 
APPLICANT: Xu, Jiangchun 
APPLICANT: Mitcham, Jennifer Lynn 

TITLE OF INVENTION: COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
TITLE OF INVENTION: OF PROSTATE CANCER AND METHODS FOR THEIR USE 
FILE REFERENCE: 210121. 427C8 
CURRENT APPLICATION NUMBER: US/ 09/352 , 616A 
CURRENT FILING DATE: 1999-07-13 
NUMBER OF SEQ ID NOS : 472 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 301 
TYPE : DNA 

ORGANISM: Homo sapien 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (1) . . . (301) 
OTHER INFORMATION: n = A,T,C or G 
US-09-352-616A-279 

Query Match 3.6%; Score 138.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.3e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 

Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II I I I I I I III 1 I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 
Db 1 AAAGCAG GAAT GACAAAG CT T G-CTTTTCT GGT AT GT T CT AG GT GT ATT GT GAC T T T T AC 59 

Qy 3374 TGTT GTATTAATTGCCAATATAAGTAAATATAGATTATATATAT CTATATATAGTGTTTC 3433 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I III I I I I I I I I I I I 

Db 60 T GT TAT AT T AAT T G C C AAT AT AAGT AAAT AT AG AT TATA TATGTATAGTGTTTC 113 

Qy 3434 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 34 87 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II li 

Db 114 ACAAAGC T T AGAC CT T T AC CT T - C CAGC C AC C C C ACAGT GCT T GAT ATT T C AGAGT CAGT 172 

Qy 34 8 8 CATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTA-GG 354 6 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I 

Db 173 CAT T GGT TAT AC AT GT GT AGT T C C AAAG C ACAT AAG CT AGAAN AANAAAT AT TT C T AGG G 232 

Qy 3547 C G C AC T AC CAT C T GT T T T C AAC AC GAAC C GAC G C CAT G C AAAC AGAAC T C C T C AAC AT AA 3606 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 233 AGCACTACCATCTGTTTTCA CAT GAAAT G C C AC AC AC AT AG AACT C — C AAC AT C A 286 

Qy 3607 AC T T C AC T G C AC AG A 3621 

I I I I I II I I I I I I 
Db 2 87 AT T T CAT T G C AC AG A 301 



RESULT 13 

US-09-232-149A-279 

; Sequence 279, Application US/09232149A 
; Patent No. 6465611 
; GENERAL INFORMATION: 



; APPLICANT: Xu, Jiangchun 

; APPLICANT: Dillon, Davin C. 

; APPLICANT: Mitcham, Jennifer Lynn 

; TITLE OF INVENTION: COMPOUNDS FOR IMMUNOTHERAPY OF PROSTATE 
; TITLE OF INVENTION: CANCER AND METHODS FOR THEIR USE 
; FILE REFERENCE: 210121. 427C6 

; CURRENT APPLICATION NUMBER: US/ 09/232 , 14 9A 
; CURRENT FILING DATE: 1999-01-15 
; NUMBER OF SEQ ID NOS : 338 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 279 

LENGTH: 301 

TYPE : DNA 
; ORGANISM: Homo sapien 

FEATURE: 
; NAME/ KEY: misc_f eature 

LOCATION: (1) . . . (301) 

OTHER INFORMATION: n = A,T,C or G 
US-09-232-149A-279 

Query Match 3.6%; Score 138.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.3e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 

Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II I II I I 1 III I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1 AAAG C AG GAAT GACAAAG CT T G - CT T T T CT GGT AT GT T CT AGGT GT ATT GT GACT T TT AC 59 

Qy 3374 T GTT GT ATTAATT GCCAAT ATAAGTAAAT ATAGATTATATATAT CT ATAT AT AGT GTT T C 3433 

I I I I I I I i I I I I I I I I I I II I I II I I I I I I I II I I I I I III I I I 1 I I I I I I I 

Db 60 TGTTATATTAATTGCCAAT ATAAGTAAAT AT AGATTATA TAT GT AT AGT GTT T C 113 

Qy 3434 AC GAAGCT T AGC C CT T T AC C T T C C C AG C T GC C C CACAGT GCT T GAT ACT TCTGT 34 87 

I 1 M I I I I II I I I I I I I M I I I I I I I II I I I I I I I I I I I I II I II I I 

Db 114 ACAAAGCT T AG AC CT T T AC C T T - C C AG C C AC C C CACAGT GCT T GAT ATT T C AGAGT C AGT 172 

Qy 34 8 8 CATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTA-GG 3546 

I I I I I I I I 1 I I I I I I M I I I I I I I I II I I M II I I II I I I I I I I I I 

Db 173 CAT T GGT TAT AC AT GT GT AGT T C CAAAG C AC ATAAGCT AGAANAANAAAT AT T T C TAG G G 232 

Qy 3547 C G C ACT AC CAT CT GT T T T CAAC AC GAAC C GAC G C CAT G CAAACAGAACT C C T C AACATAA 3606 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 233 AG C ACT AC CAT CT GT T T T C A CAT GAAAT G C C AC AC AC AT AGAACT C — CAACATCA 286 

Qy 3607 ACTTCACTGCACAGA 3621 

I I I I I I I I I I I I I 
Db 287 AT T T CAT T G C AC AGA 301 



RESULT 14 
US-09-159-812-279 

; Sequence 279, Application US/09159812A 

; Patent No. 6613872 

; GENERAL INFORMATION: 

; APPLICANT: Xu, Jiangchun 

; APPLICANT: Dillon, Davin C. 

; TITLE OF INVENTION: COMPOUNDS FOR IMMUNOTHERAPY OF 



TITLE OF INVENTION: PROSTATE CANCER AND METHODS FOR THEIR USE 
FILE REFERENCE: 210121. 428C5 

CURRENT APPLICATION NUMBER: US/09/159, 8 12A 
CURRENT FILING DATE: 1998-09-23 
NUMBER OF SEQ ID NOS : 306 

SOFTWARE: Fast SEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 301 
TYPE: DNA 

ORGANISM: Homo sapien 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (1) . . . (301) 
OTHER INFORMATION: n = A,T,C or G 
US-09-159-812-279 

Query Match 3.6%; Score 138.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.3e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 

Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II I I I I I I III I I I I I 1 I I I I I I I I I I II I I I I M I I I II I I I I I I I I I I I I I 
Db 1 AAAGCAGGAATGACAAAGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTAC 59 

Qy 3374 T GT T GT AT TAAT T GC CAAT AT AAGT AAAT ATAGAT TAT AT AT AT CT AT AT AT AGT GTT T C 3433 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I 

Db 60 T GT TAT AT TAAT T GC CAAT AT AAGT AAAT AT AG AT TATA TATGTATAGTGTTTC 113 

Qy 3434 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 3487 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

Db 114 ACAAAG CT T AGAC CT T T AC C T T - C CAGC CACC C CACAGT GCT T GAT AT TT C AGAGT CAGT 172 

Qy 3488 CATGGGTTTTATGTGTGTAGTCCCAAAGCACAT7^AGCTAGGGAGAAACGTACTTCTA-GG 354 6 

I I I II I I I I I I I I I I I I II I I I I I I I I I I I M I I I II I I I I I I I I I 

Db 173 CAT T GGTT AT AC AT GT GT AGT T C C AAAG C AC AT AAGCT AGAANAAN AAAT AT T T C TAG G G 232 

Qy 3547 C G C ACT AC CAT CT GT T T T CAACAC GAAC C GAC GC CAT G CAAAC AGAAC T C CT CAACAT AA 3 606 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II I I I I I I I 

Db 233 AG C ACT AC CAT C T GT T T T C A C AT GAAAT G C CAC AC ACAT AGAAC TC— CAACAT C A 286 

Qy 3607 ACTTCACTGCACAGA 3621 

I I I I I I I II I I I I 
Db 287 AT T T CAT T G C AC AGA 301 



RESULT 15 
US-09-636-215-279 

Sequence 279, Application US/09636215 
Patent No. 6620922 
GENERAL INFORMATION: 
APPLICANT: Xu, Jiangchun 
APPLICANT: Dillon, Davin C. 
APPLICANT: Mitcham, Jennifer L. 
APPLICANT: Harlocker, Susan L. 
APPLICANT: Jiang, Yuqui 
APPLICANT: Henderson, Robert A. 
APPLICANT: Kalos, Michael D. 



APPLICANT: Fanger, Gary R. 
APPLICANT: Retter, Marc W. 
APPLICANT: Stolk, John A. 
APPLICANT: Day, Craig H. 
APPLICANT: Vedvick, Thomas S. 
APPLICANT: Carter, Darrick 
APPLICANT : Li , Samuel 
APPLICANT: Wang, Aijun 
APPLICANT: Skeiky, Yasir A.W. 
APPLICANT: Hepler, William 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY AND 
TITLE OF INVENTION: DIAGNOSIS OF PROSTATE CANCER 
FILE REFERENCE: 21012 1 . 42717C17 
CURRENT APPLICATION NUMBER: US/ 09/ 636, 2 15 
CURRENT FILING DATE: 2000-08-10 
NUMBER OF SEQ ID NOS : 852 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 301 
TYPE: DNA 

ORGANISM: Homo sapien 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (1) . . . (301) 
OTHER INFORMATION: n = A,T,C or G 
US-09-636-215-279 

Query Match 3.6%; Score 138.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.3e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 

Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II I I II I I III I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 AAAGCAGGAAT GACAAAGCT T G-CTTTTCTG GT AT GT T CT AGGT GT AT T GT GACT T TT AC 59 

Qy 3374 TGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCTATATATAGTGTTTC 3433 

I I I I I I I I I I I I I I I I I I II I M I I M I I I I I I I I I I I III I I I I I I II I I I 

Db 60 T GT T AT AT T AAT T GC CAAT ATAAGT AAAT AT AGAT TATA TATGTATAGTGTTTC 113 

Qy 3434 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 3487 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I II II 

Db 114 ACAAAG CT T AGAC CT T T AC C T T - C CAG C C AC C C CACAGT GCT T GAT AT T T C AGAGT CAGT 172 

Qy 3488 CATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTA-GG 3546 

III I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I II 

Db 173 CAT T GGT T AT ACAT GT GT AGT T C CAAAG C AC ATAAGC T AGAANAANAAAT AT T T CT AGG G 232 

Qy 3547 C G C AC T AC CAT C T GT T T T C AAC AC GAAC C G AC G C CAT G C AAAC AGAAC T C C T C AAC AT AA 3606 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 233 AG C ACT AC CAT C T GT T T T C A CAT G AAAT GCCACAC ACAT AGAACTC — C AAC AT C A 286 

Qy 3607 AC T T C ACT GC AC AGA 3621 

I I I I I I I I I I I I I 
Db 2 87 AT T T CAT T GC ACAGA 301 



Search completed: September 11, 2004, 16:11:44 



Job time : 240.854 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on : 



September 11, 2004, 01:22:50 ; Search time 1630.06 Seconds 

(without alignments) 
11831.342 Million cell updates/sec 



Title: US-09-830-972-28 
Perfect score: 3833 

Sequence : 1 ctatctcctctct'cagccgc ttaaaaaaaaaaaaaaaaaa 3 833 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3304383 seqs, 2515761380 residues 

Total number of hits satisfying chosen parameters: 6608766 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 
2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB. seq: * 
3: /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB.seq:* 
4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB. seq:* 
5 : /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB. seq: * 
6 : / cgn2_6/ptodat a/ 1 /pubpna/ PCTUS_PUBCOMB . seq : * 
7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB. seq: * 
8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB. seq: * 
9: /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB.seq:* 
10 : /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. seq: * 
11: /cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB. seq:* 
12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB . seq2 : * 
14 : /cgn2_6/ptodata/l/pubpna/USlOA_PUBCOMB.seq: * 
15: /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB. seq: * 
16: /cgn2_6/ptodata/l/pubpna/USlOC_PUBCOMB.seq:* 
17 : /cgn2_6/ptodata/l/pubpna/US10_NEW_PUB.seq: * 
18 : /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 
19: /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB.seq:* 

Pred. No. is. the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

US-10-060-036-53 

; Sequence 53, Application US/10060036 
; Publication No. US20030073144A1 
; GENERAL INFORMATION: 



APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes, Michael J. 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/10/060,036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 

SOFTWARE: Fast SEQ for Windows Version 4.0 
SEQ ID NO 53 
LENGTH: 4 632 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-060-036-53 



Query Match 69. 1%; 

Best Local Similarity 85.0%; 
Matches 3292; Conservative 



Score 2649. 8; 
Pred. No. 0; 
0; Mismatches 



DB 15; 

442; Indels 



Length 4 632; 

137; Gaps 



24; 



Qy 

Db 



1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I II I 1 I I I I II II 
853 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 912 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



61 GT ACT GC C C ACT GAAG GAAC AC T T C C AGCAACTT CAAAT GAAGCT T C T AAAGC AT T CT C A 120 
Ml I I I I I I I I I I I I I I I I I II I II II I I I I I II I I I I I I I I I I I I I I 
913 GT AT T AC C C ACT GAAGGAACACT T C AAGAAAAT GT CAGT GAAGC T T C T AAAGAGGT C T C A 972 

121 GAGAAGG CAAAAAAT C CAT T T GTAGAGAGAAATT T AAC AGAAT T T T CAGAAT T GGAAT AT 180 

I I I I I I I I I I I I I II I I I I I I III I I I I I I I I I I I I I I I I I I I I I Mill 
97 3 GAGAAGGCAAAAACT CTACT CATAGATAGAGATTTAACAGAGTTTT C AGAATTAGAAT AC 1032 



181 



240 



T C AGAAAT GGAAT CAT CAT T CAGT G G CT C T CAAAAG G C AGAACCT GC C GT AAC AGT AGC G 

I I II I I I I I I I I I I I I I II I I II I II I I III I I I I I I II I I I I I I I I I I I M 

1033 TC AGAAAT GGGAT CAT CGTT CAGT GT CTCT C CAAAAGC AGAAT CT GCCGTAATAGTAGCA 1092 



241 



300 



AAT CCT AGGGAC GAAATAGTT GT GAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGT CTT 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

1093 AAT C CT AGGGAAGAAATAAT C GT GAAAAATAAAGAT GAAGAAGAGAAGTTAGTTAGTAAT 1152 

301 AAC AT C C T T CAT ACT C AG C AG GAGT T AT CT AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 3 60 

II I I I I I I I I I I I II I II I I I I I I II I I I I Mill II II I M I I I I II 

1153 AACAT CCTT CATAAT CAACAAGAGTTACCTACAGCT CTT ACTAAATT GGTTAAAGAGGAT 1212 



361 GAT AGAGT T C T GT C T C CAGAAAAAACAAAG GACAGT T TT AAG GAAAAGG GAGTT G CAG C A 
II I I I I Mill I I II I I I I I II I I I I I I I M I II I I II I I I I I I I I II I 
1213 GA AGT TGTGTCTT C AGAAAAAG CAAAAG AC AGT T T T AAT GAAAAG AG AGT T G CAGT G 



420 



1269 



480 



421 GAAG CT T C TAT G GG GGAG GAAT AT G CAGACT T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 
II I I I I Mill II II I II I I I I I M I I I I I II I I I II II I II II I I I II I I I M I I I I 
127 0 GAAGC T C CT AT GAG G G AGGAAT AT G CAGACT T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 132 9 



481 AAAGAT ACTTACAAGCAAGAT AGT GAT GTTTT GATT GCT GGAGGTAATAT AGAGAGCAAA 
II I II II I III I II I I I I II I I I III I II I M I II II I II I II I M II 



540 



Db 



1330 AAAGATAGT AAGGAAGAT AGT GAT AT GT TGGCTGCTG GAGGTAAAAT C GAGAG C AAC 1386 



Qy 


541 


Db 


1387 


Qy 


601 


Db 


1447 


Qy 


661 


Db 


1507 


Qy 


718 


Db 


1567 


Qy 


778 


Db 


1627 


Qy 


837 


Db 


1687 


Qy 


897 


Db 


1747 


Qy 


957 


Db 


1807 


Qy 


1017 


Db 


1867 


Qy 


1077 


Db 


1927 


Qy 


1137 


Db 


1987 


Qy 


1197 


Db 


2047 


Qy 


1257 


Db 


2104 


Qy 


1317 


Db 


2164 



TTGGAAGGTAAAGT GGATAAGAAACACTT TT CAGATAGCCTT GAACAAACAAAT C GT GAA 600 

I I I I I I II I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I III 

T T G GAAAGT AAAGT GG AT AAAAAAT GT T T T G C AGATAG C C TT GAGCAAAC T AAT C AC GAA 14 4 6 

AAAGATAGT GAAAG CAGTAAT GAT GAC AC T T CAT T T C C C AGT AC AC C AGAAG CT GT AAGA 660 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III 
AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T CT T T CC C C AGT AC G C C AGAAGGT AT AAAG 150 6 

GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAAT GT T 717 

I I M I I I I I I I II I I I I I I I I I I I II I I I I I I I I M II I II I I I I II 

GAT CGT T CAGGAGCAT ATAT CACAT GT GCT C C CTTTAAC C CAGCAGCAACT GAGAGC ATT 1566 

T CAACAAAC AT TTTTCCCTT GT T GGAAGAT C AT ACTT C G GAAAAT AAGAC AGAT GAAAAA 777 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
G CAACAAAC AT TTTTCCTTTGT T AGGAGAT C CT ACTT CAGAAAAT AAGAC C GAT GAAAAA 162 6 

AAGAT AGAA- AAAAAAAGGC ACAAAT T GT AAC AGAGAAGAAT GCAAGT GT CAAGACAT C A 83 6 

II I I I I II I I I I I I I I I I I I M I I I I I I II M I I I I I I II Ml I I I I I I 
AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AAC AGAGAAGAAT ACT AG C AC C AAAACAT C A 168 6 

AAC C CT T T C CT TAT GGCAG C ACAGGAGT CT AAGACAGAT T AC GT T ACAAC AG AT CAT GT G 896 

II I I I I I I III I I I I I I I I I I I I I I I I I I II I Ml M I 

AAC CCTTTTCTT GT AG CAGC ACAG GAT T C T GAGACAGAT TAT GT C ACAAC AGAT AATT T A 17 4 6 

T CAAAGGT GAC C GAGGAAGT AGT GG CAAAC AT GC CT GAAGGT CT AAC C CC AGAT T T GGT T 956 

I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I M II I II I I I I I II 
ACAAAG GT GAC T GAGGAAGT C GT GGCAAAC AT G C CT GAAG G C CT GAC T C C AGAT T T AGT A 1806 

CAG GAAGCAT GT GAAAGT GAAT T GAAT GAAGCT ACT GGT ACAAAAAT T G C CT T T GAAAC A 1016 

I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 

C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT T AC T GGT ACAAAGAT T GCT TAT GAAACA 18 66 
AAAAT GGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I Mill 

AAAAT GGACT T GGT T CAAAC AT C AGAAGT TAT GCAAGAGT C AC T CT AT C CT GC AGCACAG 192 6 

CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I Mill I II M I I II II I M II M II II 

CT T T GC C CAT CAT T T GAAGAGT C AGAAG C TACT C CT T C AC C AGT T T T GC CT GAC AT T GT T 198 6 

AT GGAAGCAC CAT T AAAT T CT GT AGT T C CT AGT GCT GGT GCT T CT GC AGT GCAGCT C AGT 119 6 

I I II I II II II I II II II II I I I II I II I II I I I II II M M I I MM III 

AT GGAAGCAC CATTGAATTCTGCAGTTCCTAGT GCT GGT GCT TCCGT GAT ACAGCCCAGC 2 04 6 
T CAT C AC CAT T AGAAACT C T T C CT T C AGT T AAT TAT GAAAG CAT AAAGT T T GAGC C T GAA 125 6 

I I I I I II II II II II I I II II I I I I I I I I II I I M II I II I I I II I I I I II I 

T CAT C AC CAT T AGAAGC T T C T T C AGT T AAT TAT GAAAGC AT AAAACAT GAG C CT GAA 2103 

AAT CCC CCAC C ATAT GAGGAGGC CAT GAAT GT AT CACTAAAAAAAGAAT C AGGAAT GAAT 1316 

II I I I I M II I II II I I II I I I I M I I II I I I I I I I I I I I I II I I I M 

AAC C C C C CAC CAT AT GAAGAG GC CAT GAGT GT AT CACT AAAAAAAGT AT C AGGAAT AAAG 2163 
GAAGAAAT C ACAGAGC C T GAAG GT AT T AGT GT AGC T GT T CAG GAAAC AGAAGC T C CT T AT 137 6 

I I I I I I I I I I I II II I II I I II I I II I II I MM I I II II II II I II M II I 

GAAGAAATTAAAGAGC CT GAAAAT ATTAAT GCAGCT CT TCAAGAAACAGAAGCT CCT TAT 2223 



Qy 1377 AT AT C TAT T G CAT GT GAT T T AAT T AAAGAAACAAAGAT CT C TAG T GAAC C GACT C CAGAT 1436 

I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I III II I I I I I II I I II I 
Db 2224 AT AT C TAT T GC AT GT GAT T T AAT TAAAGAAAC AAAG CT T T CT G C T GAAC CAGC T C C GG AT 2283 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 1496 

I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 22 8 4 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 234 3 

Qy 1497 GT T GAAGATT C CT C C CC C GAT T CT GAAC C AGT T GACT TAT T T AGT GAT GAT T CAAT AC C C 1556 

I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 

Db 234 4 GTT GAAGAT T C CT CAC C T GAT T CT GAAC C AGT T GACT TAT T T AGT GAT GAT T CAAT AC C T 2403 

Qy 1557 GAAGT T C C ACAAAAACAAGAT GAAG CT GT AAT AC T T GT GAAAGAAAAC CT CAC T GAAAT T 1616 

II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 2404 GAC GT T C CACAAAAACAAGAT GAAACT GT GAT GC T T GT GAAAGAAAGT CT CAC T GAGAC T 24 63 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT C AGTGCTT CACCAT CAC CT 1676 

I I I I I I I I I I I I I I I II I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 2464 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAACT C AGT GCT T T G C CACCT 2520 

Qy 1677 GAGG GAG GAAAAC C GT AT T T GGAGT CT T T T C AG C C CAGT T T AGGC AT C ACAAAAGAT AC C 17 36 

I I I I I I I I I I I II I M I I I I I MINI III II I I I I I I I I I II I I I I I I I II 
Db 2521 GAGG GAGGAAAG C CAT AT T T GGAAT CT T T T AAGCT CAGT T TAGATAAC ACAAAAGAT AC C 2 58 0 

Qy 1737 T TAG C AC CT GAT GAAGT T T CAG CAT T GAC C CAAAAG GAGAAAAT C C CT T T GC AGAT G GAG 1796 

I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I 

Db 2581 C T GT T AC C T GAT GAAGT T T CAACAT T GAG C AAAAAGGAGAAAAT T C C T T T GC AGAT G GAG 264 0 

Qy 1797 GAGC T CAAT AC T G CAGT T TAT T CAAGT GAT GGCT TAT T CAT T G CT CAG GAAGC AAAC CT A 1356 

I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I III II I I I I I I II I II 
Db 2641 GAGC T CAGT ACT G CAGT T TAT T CAAAT GAT GAC T TAT T TAT T T C TAAG GAAGCAC AGAT A 27 0 0 

Qy 1857 AGAGAAAGTGAAACATTTT CAGATT CAT CT CCGATT GAGATTAT AGAT GAGTT CCCGAC C 1916 

I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2701 AGAGAAACTGAAACGTTTT CAGATT CAT CT CCAATT GAAATTATAGAT GAGTT CCCTACA 27 60 

Qy 1917 T TT GT C AGT T CT AAAG CAGAT TCTTCTCC T AC AT TAG C CAG G GAAT AC AC T GAC CT AGAA 197 6 

II I I I I I I I I I I I I Mill I I III I II I I I II I I I I M I I II I II I I I I I I 

Db 2761 T T GAT C AGTT CTAAAACT GAT T CAT T T T C T AAAT TAG C C AGG GAAT AT AC T GAC C T AGAA 2820 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

III II I I I II II I II II II I II I I M I M I I I II II II I I I I I II I I I I M 

Db 2 821 GTATCCCACAAAAGT GAAAT TGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 28 80 

Qy 2037 GGAT T G C C C CAT GAC CT T T C T T T CAAG AGT AT AC AAC C TAAAGAG GAAGT T CAT 2090 

I I I I I II I I I II II I II II I I I I I II M I I I I II I I I II I I I I I 

Db 2881 GAAT T GC C CC AT GAC CT T T C T T T GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 294 0 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I I I I II I I I II M I I I M III II I I II II I I I II I I I I M I I 

Db 2941 T T CT CAGAT GAC T T T T CTAAAAAT GG GT CT G C T ACAT CAAAG GT GCT C T TAT T GC CT C C A 3000 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AGGCAG CAT AGAAAAAC C CAAAGT T C T T 2210 

II I I I II I I I I I II II I I I I I II I M I I I I I I I I I I I I I I I II I I I I II M I I 

Db 3001 GAT GTT TCTGCTTTGGCCACTCAAGC AGAGAT AGAGAGCAT AGTT AAAC CCAAAGTTCTT 3060 



Qy 2211 GT GAAAGAAG C C GAGAGAAAACT T C CT T C T GAT ACAGAAAAAGAG C GAAGAT C T C CAT CT 2270 

I I I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I II I II I II I I I I I I I I I I I I I 
Db 3061 GT GAAAGAAG CT GAGAAAAAACT T C CT T C C GAT ACAGAAAAAGAG G AC AGAT C AC CAT CT 312 0 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
Db 3121 GCT AT ATT T T C AGC AGAG CT GAGT AAAACT T C AGT T GT T GAC C T C C T GT ACT G G AGAGAC 3180 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 3181 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3240 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

II I I I II I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3241 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3300 

Qy 2 451 TTTAGGAT AT ATAAGGGTGT GAT CCAGGCT AT CCAGAAAT CT GAT GAAGGC CACC CAT T C 2510 

I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I 
Db 3301 TTTAGGAT AT ACAAGGGT GT GAT CCAAGC TAT CCAGAAAT CAGAT GAAGGC CACC CAT T C 3360 

Qy 2511 AGGGC AT AT T T GGAAT CT GAAGT T G CT AT AT CT GAGGAGT T GGT T C AGAAGT AC AGCAAT 257 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I III 
Db 3361 AGG GC AT AT CT GGAAT CT GAAGT T GCT AT AT CT GAGGAGT T GGT T C AGAAGT AC AGT AAT 3420 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I II I I I II I I II I I I I I I II I I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3421 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 34 80 

Qy 2631 GAT T T AGT T GAT T C T CT GAAGT T T GC AGT GT T GAT GT G GGT AT T T AC C TAT GT T GGT G C C 2 690 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I 
Db 3481 GAT TTAGT T GAT T CT C T GAAGT TTGCAGTGTT GAT GTGGGTATTTACCTATGTT GGT GCC 3540 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I 
Db 3541 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3600 

Qy 2751 ATT TAT GAAC GG C AT C AGGC GCAAAT AGAT CAT T AT CT GGGACT T GCAAAT AAGAAT GT T 2 810 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I 
Db 3601 ATT TAT GAACG GC AT C AGG C AC AGAT AGAT CAT T AT CT AGGACT T GCAAAT AAGAAT GT T 3660 

Qy 2811 AAAGAT G C TAT GG CTAAAAT C C AAGCAAAAAT C C C T GGAT T GAAGC GTAAAG CT G AAT GA 2 870 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 3661 AAAGAT G CT AT G GCTAAAAT C CAAG CAAAAAT C C C T G GAT T GAAG C G C AAAG CT G AAT GA 3720 

Qy 2 871 GAAAGC CT GAAAGAGTTAAC AAT AGAG GAGT T TAT CTTTAAAGGGGAT AT T CATTT GATT 2 930 

Mill I I I I I I I I I II II I II I I I I I I I II I I I I I II I II I I I I II 

Db 3721 AAACGCCCAAAATAATTAGT AGGAGTT CAT CTTTAAAGGGGAT ATT CATTT GATT 3775 

Qy 2931 C CAT T G GGGAGG GT C AG G GAAG AAC AAAG C CT T GACAT T G C AGT G C AGTT T CACAGAT CT 2 990 

II I I I I I 

Db 3776 AT ACGGATCT 3785 

Qy 2 991 TTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCA 304 9 

I I I I I I I II I I I Ml II I II I I II I II I I I I I I I I II I I II I I M I M II I I I 
Db 3786 TTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCA 3845 



Qy 3050 TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTC 3109 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 II 1 1 1 i I 1 1 1 1 1 1 

Db 3846 TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCATA--TCTTTTTC 3903 

Qy 3110 C T GT AT GAG G C ACT GGT GAAT AAACAAAGAT C T GAGAAAGC T GT AT AT T AC AC T T T GT C G 3169 

I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 3904 CTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTTGTTG 3952 

Qy 317 0 CAGGTAGT CTTGCT GTAT - TT GGGGAATTGCAAAGAAAGT GGAGCT GACAGAAA 3222 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 3953 CAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAAA 4 012 

Qy 322 3 TAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTG 3282 

II I I I I I I I I I II I I I II I I I I I I I II I I II I I I I I I I I I II I I I I I I I I 
Db 4013 AAAAGCCCTTTTCAGTTTGTGCACTGT GTAT GGTCCGTGTAGATT GAT GCAGATTTTCTG 4072 

Qy 3283 AAATGAAA T GT T T AGAC GAGAT CAT GCC AC CAAG GC AG GAGT GAAAAAG C T T G C CT 3338 

I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I II I I I I I I I I I II 

Db 4073 AAAT GAAAT GT T T GT T T AGAC GAGAT CAT AC C GGTAAAG C AGGAAT GAC AAAGC T T G - C T 4131 

Qy 3339 TTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGT 3398 

II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 4132 TTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGT 4191 

Qy 33 9 9 AAAT AT AGAT TAT AT AT AT CT AT AT AT AGT GT T T CAC GAAGC T T AGC C CT T T AC C T T C C C 345 8 

I I I I II I I i II I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I II II 

Db 4192 AAAT AT AGAT TAT AT AT G T ATAGT GT T T C ACAAAG CT T AGAC C T T TAC C T T - C C 4244 

Qy 3459 AGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCA 3512 

III I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4245 AGC CAC C C CAC AGT GCT T GAT AT T T C AGAGT C AGT CAT T GGT TAT AC AT GT GT AGT T C C A 4304 

Qy 3513 AAGC AC AT AAGC T AGGGAGAAAC GT ACT T CT AG G C GCAC TAC CAT C T GTT T T C AACAC GA 3572 

I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I II I I II I II 

Db 4305 AAGCACATT^GCTAGAAGAAGAAATATTTCTAGGAGCACTACCATCTGTTTTCAACATGA 4 364 

Qy 3573 AC C GAC GC CAT GCAAAC AGAACT C C - T C AAC AT AAAC T T CAC T GC ACAGAC TT ACT GT AG 3631 

I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I i I I I I I I I I I 
Db 4365 A AT G C CAC AC AC AT AGAACT C CAAC AAC AT CAAT T T CAT T GCACAGAC T GACT GT AG 4 421 

Qy 3632 TTAATTTTATCAC — AAACTCTGGACTGAATCTAATGCTTCCAAAAA TGTTTG 3682 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I MINI 
Db 4422 TTAATTTTGTCACAGAATCTATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTG 4481 

Qy 3683 C AAAT AT C AAAC AT T GT TAT GT AAGAAAAT AT AAAT GAC GAT T T AT AC AAT T 3734 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I IN 

Db 4482 C AAAT AT C AAAC AT T GT TAT G C AAGAAAT TAT T AAT T AC AAAAT GAAG AT T TAT AC CAT T 4541 

Qy 3735 GTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGT 3794 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 4542 GTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGT 4 601 

Qy 37 95 AT CAAT AAAG CT TAT AGAC T T AAAAAAAAAA 3825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 602 ATCAATAAAGCTTATAGACTTAAAAAAAAAA 4 632 



RESULT 2 



US-09-758-140-5 

Sequence 5, Application US/09758140 
Patent No. US20020012965A1 
GENERAL INFORMATION: 
APPLICANT : Strittmatter , Stephen M. 

TITLE OF INVENTION: No. US2 0020012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

FILE REFERENCE: 4 457 4-5073-US 
CURRENT APPLICATION NUMBER: US/ 0 9/75 8 , 14 0 
CURRENT FILING DATE: 2001-01-12 
PRIOR APPLICATION NUMBER: US 60/175,707 
PRIOR FILING DATE: 2000-01-12 
PRIOR APPLICATION NUMBER: US 60/207,366 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: US 60/236,378 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 20 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 5 
LENGTH: 4 053 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: (135) . . (3710) 

OTHER INFORMATION: Human mRNA for No. US20020012965Alo protein (KIAA0886, 
GenBank 

OTHER INFORMATION: Accession No. US20020012965A1 AB020693) 
US-09-758-140-5 

Query Match 62.1%; Score 237 9.4; DB 9; Length 4 053; 

Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 

Qy 1 CT AT CTCCTCTCT C AGC C G CT GCT T T T AAAGAAC GT GAAT AC CT T G GT GAT T T AC CAGC A 60 

11 I I I I I I I II I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I II II 
Db 84 6 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 

Qy 61 GT ACT GCC CACT GAAGGAACACTT C CAGCAACTT CAAAT GAAGCTT CTAAAGCATT CT CA 12 0 

I M I I M I II I I I I I I i I I II I I II I I I I M I I I I I I I I I I I I I I I I I 
Db 906 GTATTACC CACT GAAGGAACACTT CAAGAAAATGTCAGT GAAGCTT CTAAAGAGGT CTCA 965 

Qy 121 G AG AAG G C AAAAAAT C CAT T T GT AG AG AGAAAT T T AAC AGAAT T T T C AGAAT T G GAAT AT 18 0 

I I I I I I I I I I I I I II I I I I I I Ml II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 966 GAGAAGGCAAAAACT CT ACT CATAGATAGAGATTTAACAGAGTTT T CAGAATT AGAAT AC 1025 

Qy 181 T C AGAAAT G GAAT CAT CAT T C AGT G GCT CT C AAAAGGC AGAACCT GCC GT AAC AGT AGC G 240 

| M I I I I I I I I I I I I I I I I I I M IMM I M I II II I I II II II I I II M I I 
Db 1026 T CAGAAAT GGGAT CAT C GT T C AGT GT CT CT C CAAAAGC AGAAT C T GC C GT AAT AGTAGC A 1085 

Qy 241 AAT C CT AGGGAC GAAAT AGTT GT GAGGAGT AGAGATAAAGAAGAGGACT T AGTTAGT CTT 300 

I M II I I II II I I I I II I II II I II I I II I I I I I I II I II I I I I I I I I 
Db 108 6 AAT CCT AGGGAAGAAATAAT C GT GAAAAATAAAGAT GAAGAAGAGAAGT T AGTT AGT AAT 114 5 

Qy 301 AAC AT CCT T CAT ACT C AG C AGGAGT TAT C T AC AGT C CT T AC GAAAT C AGT T GAAGAAGAA 360 

I || || M II I I I I I I I M M II I I M I I I I I I I I I II II I I I I I I I M 
Db 114 6 AAC AT CCT T CAT AAT CAACAAGAGT TACCTACAGCT CTTACTAAAT T GGTTAAAGAGGAT 12 05 



361 GAT AGAGT T CTGT CT CCAGAAAAAACAAAGGAC AGT T TTAAGGAAAAGGGAGTT GCAGCA 42 0 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I II I I I 

120 6 GA AGT T GTGT CTT C AGAAAAAGCAAAAGACAGTT TTAAT GAAAAGAGAGTT GCAGT G 12 62 

421 GAAG CT T CT AT G G G G GAG GAAT AT G C AGAC T T CAAAC CAT T T GAGC GAGT AT GGGAAGT G 4 80 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II M 

1263 GAAGC T C C TAT G AGG GAGGAAT AT G CAG AC T T CAAAC CAT T T GAG C GAGT AT GGGAAGT G 1322 

4 81 AAAGAT AC T T AC AAG CAAGAT AGT GAT GT T T T GAT T G CT G GAG GT AAT AT AGAGAG C AAA 54 0 
I II I I I I I III I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I I II I 

1323 AAAGATAGT AAG GAAG AT AGT GAT AT GTTGGCTGCTG GAG GT AAAAT C GAGAG C AAC 1379 

541 TT G GAAGGT AAAGT G GAT AAGAAACACT T T T C AGAT AG C C T T GAACAAACAAAT C GT GAA 600 
I I I I I I I I I I I I I I I I I II III III I I I I I I I I I I I I I I I II I I I I I III 

138 0 TT G GAAAGT AAAGT G GAT AAAAAAT GT T T T G C AGAT AG C CTT GAGCAAAC T AAT C AC GAA 143 9 

601 AAAGATAGT GAAAG C AGT AAT GAT GAC ACT T CAT T T C C C AGT AC AC C AGAAG CT GT AAGA 660 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III 

1440 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T C T T T C C C C AGT AC G C C AGAAG GT ATAAAG 14 99 

661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAAT GT T 717 

I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 
1500 GAT C GT T CAG GAG CAT AT AT C ACAT GTGCTCCCTT TAAC C CAGC AGCAACT GAGAG CAT T 1559 

718 T CAAC AAACAT TTTTCCCTTGTT GGAAGAT CAT AC T T CG GAAAAT AAGAC AGAT GAAAAA 777 
I I I I I I I I I I I I I I I I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1560 GCAACAAACAT TTTTCCTTT GT T AGGAGAT C C T AC T T CAGAAAAT AAGAC C GAT GAAAAA 1619 

77 8 AAGATAGAA-AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGTGT CAAGACAT CA 836 

II I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I II Ml I I I I I I 

162 0 AAAAT AGAAGAAAAGAAG G C C CAAAT AGTAAC AG AGAAGAAT ACT AG C AC C AAAAC AT C A 1679 

837 AAC C CT T T C CT T AT GG C AG C AC AGGAGT CT AAGAC AGAT T AC GT TACAAC AGAT CAT GT G 8 96 
I I I I I I I I III I II I I I I I I I I I III I I I I I I I I I I II I I I I I I I I I II I 

1680 AAC CCTTTTCTT GT AG CAG C ACAGGAT T CT GAGAC AGAT TAT GT C ACAAC AGAT AATTT A 1739 

897 T CAAAGGT GAC C GAG GAAGT AGT GG CAAAC AT GC C T GAAGGT CT AAC C C C AGAT T T G GT T 956 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I II I II 

174 0 ACAAAGGT GACT GAG GAAGT C GT GG CAAAC AT GC C T GAAGGC CT GACT C C AGAT T T AGT A 17 99 

957 CAG GAAG CAT GT GAAAGT GAAT T GAAT GAAGC T AC T G GT ACAAAAAT T G C C T T T GAAAC A 1016 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

18 00 CAGGAAG CAT GT GAAAGT GAAT T GAAT GAAGT T AC T G GT AC AAAGAT T G CT TAT GAAAC A 1859 

1017 AAAAT GGAC C T GGT T CAAAC T T CAGAAG CT GT G C AGGAGT C ACT T T AC C CT GT AACAC AG 107 6 

II I I II I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I II I I I M I 
1860 AAAAT GGACT TGGTT CAAACAT CAGAAGTT AT GCAAGAGT CACT CTAT C CT GC AGCAC AG 1919 

1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I II I I I II I I I I II I I I I I I I I I I I I I I II I I I I I I I I II I 

192 0 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 197 9 

1137 AT GGAAGCACCATTAAATTCTGTAGTTCCT AGT GCT GGT GCTTCT GCAGT GCAGCT CAG T 119 6 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I Ml 

1980 AT GGAAGCACCATTGAATTCT GCAGT TCCTAGT GCT GGT GCT TCCGT GAT ACAGCC CAGC 2 039 



Qy 1197 T CAT CAC CAT T AGAAAC TCTTCCTT C AGT T AAT TAT GAAAGCAT AAAGT T T GAGC CT GAA 12 56 

I M II I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I 

Db 2040 T CAT CAC CAT T AGAAG C T T C T T C AGT T AAT TAT GAAAG C AT AAAAC AT GAG C C T GAA 2096 

Qy 12 57 AAT C C C C CAC CAT AT GAGGAG G C CAT GAAT GT AT CAC T AAAAAAAGAAT C AG GAAT G AAT 1316 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2097 AAC C CC C CAC CAT AT GAAGAG G C CAT GAGT GT AT CAC T AAAAAAAGT AT CAG GAAT AAAG 2156 

Qy 1317 GAAGAAAT C AC AGAGC CT GAAGGT AT T AGT GT AG C T GT T CAG GAAAC AGAAG C T C C TT AT 1376 

I I I II I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 2157 GAAGAAAT TAAAGAG C CT GAAAAT AT T AAT G CAG C T CT T CAAGAAAC AGAAG C T C C TT AT 2216 

Qy 1377 AT AT CT ATT GCAT GT GAT TT AAT T AAAG AAAC AAAG AT CT CTACT GAAC CGACT C CAGAT 1436 

I II I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I Ml 
Db 2217 AT AT CT AT T GCAT GT GAT T T AATT AAAGAAACAAAG CT T T CT G CT GAAC CAG C T C C GGAT 2276 

Qy 1437 T T CT CT AGT TAT T C AGAAAT AG CAGAAGT T G C AC AGC C AGT G C C C GAGC AT T C T GAGC T A 14 96 

I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I 
Db 2277 T T CT CT GAT TAT T C AGAAAT G G CAAAAGT T GAAC AG C C AGT G C CT GAT CAT T C T GAGC T A 2336 

Qy 1497 GT T GAAGAT TCCTCCCCC GAT T CT GAAC C AGT T GACT T AT T T AGT GAT GAT T CAATAC C C 1556 

I I I I I I I II II I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I 

Db 2337 GT T GAAGAT T C C T CAC CT GAT T CT GAAC C AGTT GACT TAT T T AGT GAT GAT T CAATAC CT 2396 

Qy 1557 GAAGTT C CACAAAAACAAGAT GAAGCT GTAAT ACT T GT GAAAGAAAAC C T CAC T GAAAT T 1616 

II I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I II I I I I I II I I I I 

Db 2397 GAC GT T C CACAAAAACAAGAT GAAACT GT GAT GCT T GT GAAAGAAAGT CT C ACT G AGACT 2 4 56 

Qy 1617 T CAT CT GAGT CAAT GAC AG GACAT GACAAT AAG G GAAAACT CAGT G CT T CAC CAT CAC CT 1676 

MM I I II I I II I I I II I I I I I II II I I I I I I I I II I I I II II I I I I II 
Db 2457 T CATTT GAGT CAATGATAGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 2513 

Qy 1677 GAG GGAGGAAAAC C GT AT T T GGAGT CT T T T CAG C C CAGT T T AGG C AT C ACAAAAGAT AC C 1736 

I I I I II I I II I I I II II I II I I I II I I II I II I I I I I I I II II I I I M I I I I 
Db 2514 GAGGGAGGAAAGCCATAT TT GGAAT CTTT TAAGCT CAGTTT AGATAACACAAAAGATAC C 2573 

Qy 1737 T TAG C AC CT GAT GAAGT T T CAG CAT T GAC CC AAAAGGAGAAAAT C C CTT T G CAGAT GGAG 17 96 

I II I I II I I II I I II I I I I II I I I II I I II I I I II II I I I I I I I I I I I II I I 
Db 2574 C T GT T AC CT GAT GAAGT T T CAAC AT T G AGCAAAAAGGAGAAAAT T C CTT T GC AGAT G GAG 2633 

Qy 1797 GAG CT CAAT ACT G C AGT T T ATT CAAGT GAT GGC T TAT T CAT T G CT C AGGAAGCAAAC CT A 1856 

I I II II I II I II II II I I I II I I I I I I I I I II I I I III II I I I I I I I I I II 

Db 2634 GAGC T CAGT ACT GC AGT T T ATT CAAAT GAT GAC T TAT T TAT T T CT AAGGAAGC AC AG AT A 2693 

Qy 1857 AGAGAAAGT GAAAC AT T T T CAGAT T CAT C T C C GAT T GAGAT TAT AGAT GAGT T C C C GAC C 1916 

II I I I II II II I I II I I I I I II I II II M I I I I I I I II I I I II I M I I I M I I I 

Db 2694 AGAGAAACT GAAAC GT T T T CAGAT T CAT CT C CAAT T GAAAT TAT AG AT GAGT T C C C T AC A 27 53 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II I I I I I I I II I I I II I II I I III I I II I I II I I II I I I I II II I I I M I I 

Db 2754 T T GAT CAGT T C T AAAACT GAT T CAT T TT C TAAAT TAG C CAG G GAAT AT ACT GAC CT AGAA 2813 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I I I II I I I I I I II I I I I I I I I M I I I II II I M II I I I I I I I I I I I M 

Db 2 814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2873 



Qy 



2037 GGAT T GC C C CAT GAC CTTTCTTT CAAG AGT AT AC AAC C TAAAGAG GAAGTT CAT 2090 







1 , . . 1 | | 1 1 | 1 I 1 1 1 1 1 jlll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 ! 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 MINIM 1 Mill II I 1 




Db 


2874 


GAAT T G C C C CAT GAC CTTTCTTT GAAGAAC AT ACAAC C C AAAGT T GAAG AGAAAAT C AGT 


2933 


Qy 


2091 


GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 


2150 




, , ,• it ill it it II i t i i i i i i i i i i ii i i i i i i i i 

II 1 1 1 1 1 1 1 II II III II M II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 




Db 


2934 


TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 


2993 


Qy 


2151 


GAT GTTTCTGCTTTG GAT G CT C AAG C AG AG AT AG G C AG C AT AG AAAAAC C C AAAGT T C T T 


2210 




, i i i i i i i i i i i j i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


2994 


GAT GTTTCTGCTTTGGC C AC T CAAG C AG AGAT AGAGAGCAT AGT T AAAC C C AAAGT T C T T 


3053 


Qy 


2211 


GT GAAAGAAG C C GAGAGAAAACT T C C T T CT GAT ACAGAAAAAGAG C GAAGAT CT C CAT CT 


2270 




i i i i i i i i i i i i i i i i i i i i t i i \ i i i i i i i i i i i i i t i i i i i i i i i 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 




Db 


3054 


GT GAAAGAAG CT GAGAAAAAAC T T C CT T C C GAT ACAGAAAAAGAG GAC AG AT CAC CAT CT 


3113 


Qy 


2271 


G CT AT AT T T T C AGCAGAG C T GAGT AAAAC TT CAGT T GT T GAC CT C C T CT AC T GGAGAGAC 


2330 




i t i i i i i « i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


3114 


GCT AT AT T T T C AGCAGAGC T GAGT AAAAC T T CAGT T GT T GAC C T C C T GT AC T GGAGAGAC 


3173 


Qy 


2331 


ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 


2390 




1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


3174 


AT T AAGAAGAC T G GAGT GGT GTTTGGTGC C AGC CTAT TCCTGCTGCTTT CAT T GACAGT A 


3233 


Qy 


2391 


TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 


2450 




1 1 1 1 1 1 1 1 II 1 1 1 1 IIIII M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


3234 


TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 


3293 


Qy 


2451 


T T T AGGAT AT AT AAGGGT GT GAT C C AGGC T AT C CAGAAAT CT GAT GAAGG C C AC CCAT T C 


2510 




t 1 1 l 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i J 1 1 1 1 1 i 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 t 1 \ 1 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3294 


T T TAG GAT AT AC AAG G GT GT GAT C CAAG CTAT C CAGAAAT C AGAT GAAG G C CAC C CAT T C 


3353 


Qy 


2511 


AGGGCAT ATTT GGAAT CT GAAGTT GCT ATAT CT GAGGAGTT GGTT CAGAAGTACAGCAAT 


2570 




iiiiiiiii iiiiiiiiiiiiiiitiiiilllllJlLlilllllltlllllllll III 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 M II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M III 




Db 


3354 


AGGGCAT AT CT GGAAT CT GAAGTT GCTAT AT CT GAGGAGT T GGTT CAGAAGT ACAGT AAT 


3413 


Qy 


2571 


TCTGCTCTT G GT C AT GT T AACT G CACAATAAAAGAACT CAGAC GCCTCTTCT T AGT T GAT 


2630 




iiiiiiiii iiiiiiii titii i i i i i i i l t 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIII Mill IIIIIIII M 1 1 1 II 1 1 1 II 1 1 1 1 1 1 




Db 


3414 


TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 


3473 


Qy 


2631 


GATTTAGTTGATTCTCTGAAGTTTGCAGTGTT GAT GTGGGTATTTAC CTAT GTT GGT GCC 


2690 




iiiLiiiiitiiiiiiilllilllirillllllllJltllllllllllllllllllllil 

I | | | I I I I I I I I I I I M 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M M 1 1 II 1 1 M 1 1 1 1 1 1 1 1 




Db 


3474 


GATTTAGTTGATTCTCTGAAGTTTGCAGT GTT GAT GTGGGTATTTAC CTAT GTT GGT GCC 


3533 


Qy 


2691 


TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 


2750 




■ ■ i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i > i i i i i i i i i i i i t i i i i i i i i 

Mill II 1 1 1 1 II II 1 1 M 1 1 1 II 1 II II 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


3534 


TT GTT TAAT GGT CTGACACTACTGATTTTGGCTCTCATTTCACTCTT CAGT GTT CCTGTT 


3593 


Qy 


2751 


ATTTAT GAACGGCAT CAGGC GCAAATAGAT CAT TAT CT GGGACT T GCAAATAAGAAT GT T 


2810 




■ i i i i i i i i i < i i i t i * i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 II 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 II 1 M 1 I 




Db 


3594 


AT T TAT GAAC G G CAT C AGGCACAGAT AGAT CAT TAT CT AGGACT T GCAAATAAGAAT GT T 


3653 


Qy 


2811 


AAAGAT G CT AT G G C T AAAAT C CAAGC AAAAAT C C CT GGAT T GAAG C GT AAAG CT GAAT GA 


2870 




1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 M II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 II 




Db 


3654 


AAAGAT G C TAT GGCT AAAAT C CAAGCAAAAAT C C C T GGAT T GAAGC GCAAAG CT GAAT GA 


3713 


Qy 


2871 


GAAAG C C T GAAAGAGT T AACAAT AGAG GAGT T TAT CT T TAAAG G G GAT AT T C AT TT GAT T 


2930 




I | II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II II 1 1 1 1 1 1 M 





Db 3714 AAAC GC C C AAAAT AAT T A GT AG G AGT T CAT CT T TAAAG G G GAT AT T CAT T T GAT T 37 68 

Qy 2931 C CAT T G G G GAGGGT C AGG GAAGAACAAAGC C T T GAC AT T G C AGT G CAGT T T C AC 2984 

I I I I I I I I I I M I I I I I I I I I II I I I I I ! I I I I I I I I I I I I I I I I I I 

Db 37 69 ATAC GG G G GAGGGT C AGGGAAGAAC GAA- C C T T GAC GT T G CAGT G CAGT T T C ACAGAT C G 3827 

Qy 2985 AGATCTTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTG 3038 

I I I II I I I I I I I I I I I I I III II I I I I I II I II I I I I I I I I I I I I I I I I 
Db 382 8 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 3887 

Qy 3039 CCCTGTGTT CAT CAT CT T AAGT AT T GT AAGC T GC T AT GT AT G GAT T T AAAT C GT AAT CAT 3098 

II I I I I I M I I I I M I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 3888 C CAT GT GT T CAT CAT CT T AAGT AT T GTAAGC T GCT AT GT AT G GAT T T AAAC C GTAAT CAT 3947 

Qy 3099 AT T T GT T T T T C CT GT AT GAGGC ACT G GT GAAT AAACAAAGAT CT GAGAAAG CT GT AT AT T 3158 

I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 3948 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC C T G TAT AT T 3994 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I II I I I I I I I I I I I M I I I I II I I I I I I I I I I III I I I I I I I I 
Db 3995 T T AC T T T GT T GCAGAT AGT CT T G C C G CAT CT T GG CAAGT T GC AGAGAT GGT G GAG CT 4 051 



RESULT 3 

US-09-972-599A-5 

; Sequence 5, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/ 09/972 , 599A 

; CURRENT FILING DATE: 2001-10-06 

; PRIOR APPLICATION NUMBER : PCT/US01/01041 

; PRIOR FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: 09/758,140 

; PRIOR FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: 60/236,378 

; PRIOR FILING DATE: 2000-09-29 

; PRIOR APPLICATION NUMBER: 60/207,366 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; NUMBER OF SEQ ID NOS : 57 

SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 5 
; LENGTH: 4 053 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE: 

NAME/ KEY: CDS 
; LOCATION: ( 135) (3710) 

; OTHER INFORMATION: Human DNA encoding for No. US20020077295Alo protein 
(KIAA0886, GenBank 

OTHER INFORMATION: Accession No. US20020077295A1 AB020693) 
US-09-972-599A-5 



Query Match 62.1%; Score 2379.4; 

Best Local Similarity 86.6%; Pred. No. 0; 
Matches 2 8 02; Conservative 0; Mismatches 



DB 9; Length 4053; 

381; Indels 54; Gaps 



14; 



Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGC^ 60 

II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I II II 

Db 846 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 

Qy 61 GT ACT G C C C AC T GAAGGAAC AC T T C C AGC AACT T C AAAT GAAGC T T C T AAAGCAT T C T C A 120 

III I I I I I I I I I I I I I I I I I I II II II I I I I I I II I I I I I I I I I I I I I 

Db 906 GT AT T AC C C AC T GAAGGAAC AC T T CAAGAAAAT GT CAGT GAAG CT T CT AAAGAGGT CT C A 965 

Qy 121 GAGAAGG CAAAAAAT C CAT T T GT AGAGAGAAAT T T AACAGAAT T T T CAGAAT T G GAAT AT 18 0 

I I I I I I I I I I II I I I I I I I I I III I I I I I I I I I I I I I I I II I I I I I II I I 
Db 966 GAGAAGGC AAAAACT CT AC T CAT AGAT AGAGAT T T AACAGAGT T T T CAGAAT TAGAAT AC 1025 

Qy 181 T CAGAAAT GGAAT CAT CAT T CAGT GGCTCT CAAAAGGCAGAAC C T GC C GTAACAGTAGC G 24 0 

I I I I II I I I I I I I II I I I I I I II I I I I I III I I I I II I I I I I I I I I I I I I I I 
Db 1026 T CAGAAAT G GGAT CAT C GT T CAGT GT C T C T C CAAAAGCAGAAT CT GC C GT AAT AGT AG C A 10 8 5 

Qy 241 AAT C CT AGG GACGAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAGGACT TAGT T AGT C T T 300 

I I I I I I II I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 108 6 AAT CCT AGGGAAGAAATAAT CGT GAAAAAT AAAGAT GAAGAAGAGAAGTTAGTTAGTAAT 1145 

Qy 301 AAC AT C C T T CAT ACT CAG CAGGAGT T AT CT AC AGT C CT TAC GAAAT C AGT T GAAGAAGAA 360 

I I I I I I I I I I I II III II I I I I I I I I I I I I I I I I I I I I I III I I I I II 

Db 1146 AAC AT C C T T CAT AAT CAACAAGAGT T ACCT AC AGCT CT T AC T AAAT T G GT T AAAG AGGAT 1205 

Qy 361 GAT AGAGT T C T GT CT C C AGAAAAAACAAAGGAC AGTT T T AAGGAAAAGGGAGT T GC AG C A 42 0 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 12 06 GA AGT T GT GT CT T C AGAAAAAGCAAAAGAC AGT T T TAAT GAAAAGAGAGT T GC AGT G 12 62 

Qy 421 GAAG C T T C TAT GGGGGAGGAAT AT GC AGAC T T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 4 80 

I I I I I I I I I II I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1263 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1322 

Qy 4 81 AAAGAT AC T T ACAAGCAAGAT AGT GAT GT T T T GAT T GCT G GAGGTAATAT AGAGAGCAAA 54 0 

I I I I I I I I III I I I I I I I I I I I I III I I II I II I I I I I II I II I I I I I 
Db 1323 AAAGATAGT AAGGAAGAT AGT GAT AT GTT GGCT GCTGGAGGTAAAAT C GAGAGCAAC 137 9 

Qy 541 T T GGAAGGT AAAGT GGAT AAGAAAC ACT T T T C AGAT AG CCT T GAACAAAC AAAT CGT GAA 600 

I I I I I I I I I I I I I I I I I I I III III I I I I I I I I II I I I I I I I I I I II III 
Db 138 0 T T GGAAAGT AAAGT GGAT AAAAAAT GT T T T G C AGAT AG CCT T GAGCAAAC TAAT C AC GAA 14 39 

Qy 601 AAAGATAGT GAAAGCAGT AAT GAT GAC ACT T CAT T T C C CAGT ACAC C AGAAG C T GT AAGA 660 

I I I I I II I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I III 
Db 144 0 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T CT T T C C C CAGT AC GC C AGAAGGT AT AAAG 14 99 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AACAACT GAGAAT GTT 717 

I I I I I I I I II I II I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I II 
Db 1500 GATC GTT CAGGAGCATAT AT CAC AT GT GCT C CCTTTAACC CAGCAGCAACT GAGAGCATT 1559 

Qy 718 T C AAC AAAC AT TTTTCCCTTGTTG GAAG AT CAT ACT T C G GAAAAT AAG AC AG AT G AAAAA 777 

I I I I I I I II I I I I I I I I I I II I I I I I I MINI I I I II I I I I II I I I I I I I I I 
Db 1560 G CAACAAAC AT TT T T C CT T T GT TAG GAG AT C CT ACT T CAGAAAATAAGAC C GAT GAAAAA 1619 

Qy 778 AAG AT AGAA- AAAAAAAG GC ACAAAT T GT AAC AGAGAAGAAT GCAAGT GT C AAGAC AT C A 836 



162 0 AAAAT AGAAGAAAAGAAG G C C CAAAT AGT AAC AGAGAAGAAT ACT AG C AC CAAAACAT C A 167 9 

8 37 AAC C C T T T C CT T AT G GC AG C AC AG G AGT CT AAG AC AGAT T AC GT T AC AAC AGAT CAT GT G 896 

I M I I I I I II I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I I I II I 
1680 AAC CCTTTTCTT GT AG CAG CAC AG GAT T CT GAGAC AGAT TAT GT C ACAAC AGAT AAT TT A 173 9 

8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
174 0 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1799 

957 CAG GAAGC AT GT GAAAGT GAAT T GAAT GAAG CT ACT G GT ACAAAAAT T G C C T TT GAAAC A 1016 
I I I I I I I I M I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
18 00 CAG GAAGC AT GT GAAAGT GAAT T GAAT GAAGT T ACT G GT ACAAAGAT T G C T TAT GAAAC A 185 9 

1017 AAAAT G GAC C T GGT T CAAAC T T CAGAAG CT GT G C AGGAGT C ACT T T ACC CT GTAAC AC AG 1076 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
1860 AAAAT G GACT T GGT T CAAACAT C AGAAGT T AT G CAAGAGT C ACT C TAT C CT G C AGCAC AG 1919 

1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
1920 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 197 9 

1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I III 
1980 AT GGAAGC AC CAT T GAAT T CT GC AGT T CC T AGT G CT G GT G CT T C C GT GAT ACAG C C CAG C 2039 

1197 T CAT CAC C ATT AGAAACT C T T C CT T CAGT T AATT AT GAAAG C AT AAAGT T T GAGC CT GAA 1256 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

2 04 0 T CAT CAC CAT T AGAAG C T T CT T CAGT T AAT TAT GAAAG C AT AAAAC AT GAGC C T GAA 2 096 

1257 AAT C C C C CAC CAT AT GAG GAG G C CAT GAAT G TAT CAC T AAAAAAAGAAT CAG GAAT GAAT 1316 

II I I I I II I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I 1 

2 097 AAC C C CCCAC CAT AT GAAGAGGCCAT GAGT GTAT CACT AAAAAAAGT AT CAGGAAT AAAG 2156 

1317 GAAGAAATCACAGAGCCT GAAGGTATT AGT GTAGCT GTT CAGGAAACAGAAGCT C CTT AT 137 6 

I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
2157 GAAGAAAT TAAAGAGC CT GAAAATAT T AAT G C AG CT CTT CAAGAAAC AGAAGCT C CT T AT 2216 

1377 AT AT C TAT T GC AT GT GAT T T AAT T AAAGAAACAAAGAT CT CT AC T GAAC C GACT C CAGAT 1436 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I Ml 
2217 AT AT C TAT T GC AT GT GAT T T AAT T AAAGAAAC AAAG CTTTCTGCT GAAC CAG C T C C G GAT 227 6 

1437 T T CT C T AGT T ATT CAGAAATAGC AGAAGT T G CAC AG C CAGT GCCC GAGC AT T C T GAGCT A 14 96 

I I I I I I I I II I I I I I I I I III I I I I I I II I I I II I I I I I II I I I I I I I M I I I 
2277 T T CT CT GAT TAT T CAGAAAT GG C AAAAGT T GAAC AG C CAGT G C C T GAT CAT T C T GAGCT A 2336 

1497 GTT GAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGT GAT GATTCAAT AC CC 1556 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

2337 GT T GAAGAT T C C T CAC C T GAT T C T GAAC CAGT T GACT TAT T T AGT GAT GAT T CAATAC CT 2396 

1557 GAAGT T C CACAAAAAC AAGAT GAAG CT GT AAT AC T T GT GAAAGAAAAC CT CAC T G AAAT T 1616 

II I I I I I I I I I I I I I I I I I I I I I MM II II I I II II I I I II II I I I I II I I 
2397 GAC GT T C C AC AAAAACAAGAT GAAACT GT GAT G C T T GT GAAAGAAAGT CT CAC T GAGAC T 2456 

1617 T CAT CT GAGT CAAT GACAGGAC AT GAC AAT AAGG GAAAACT CAGT GC T T CAC CAT CAC CT 167 6 
MM I I II I I II I II II I M II II I I I I I I I II II II II I I II I Mill 



Db 



2457 T CATT T GAGT CAAT GATAGAATAT GAAAATAAGGAAAAACT CAGT GCT TT GC CACCT 2513 



Qy 1677 GAG G GAG G AAAAC C GT AT T T GG AG T C T T T T C AG C C C AG T T TAG G CAT C AC AAAAG AT AC C 1736 

I I I I I I I I I I I II I I I I I II I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 2514 GAG G GAGGAAAG C CAT AT T T GGAAT CT TT T AAG C T CAGT T T AGAT AAC ACAAAAG AT AC C 2573 

Qy 1737 T T AGC AC C T GAT GAAGT T T C AG CAT T G AC C CAAAAG GAG AAAAT C C CT T T GC AGAT GG AG 1796 

I I II I I I I I I I II I I I I MINI I I I I I II I I I I I I I I I I I I I I I I I II I I I 
Db 2574 CT GT T AC C T GAT GAAGT T T CAAC AT T GAG CAAAAAG GAGAAAAT T C CT T T G C AGAT GGAG 2633 

Qy 1797 GAG CT CAAT ACT GC AGT T TAT T CAAGT GAT G G C T TAT T CAT T GC T C AGGAAG CAAAC CT A 1856 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I II 
Db 2 634 GAG C T CAGT ACT G C AGT T TAT T CAAAT GAT GACT TAT T TAT T T CT AAG GAAGC AC AGAT A 2 693 

Qy 1857 AGAGAAAGT GAAACATTTT CAGATT CATCT C CGATT GAGATT ATAGAT GAGTT C C CGACC 1916 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I II 

Db 2 694 AG AGAAAC T GAAAC GT T T T C AGAT T CAT C T C CAAT T GAAAT TAT AG AT GAGT T C C C T AC A 2753 

Qy 1917 T TT GT CAGTT CTAAAG C AGAT T CT T CT C CT AC AT TAGC C AG GGAAT ACAC T GAC CT AGAA 1976 

II I I II I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2754 T T GAT CAGTT CTAAAACT GAT T CAT T T T CTAAAT TAG C C AGGGAAT AT ACT GACC T AGAA 2813 

Qy 1977 GT AG C C CACAAAAGT GAAAT T GCT GAC AT C C AG GAT G GAGC T GG GT CAT T GG C TT GT GC A 2036 

III I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II MM M 

Db 2814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2873 

Qy 2037 GGAT T G C C C CAT GAC CT T T C T T T CAAGAGT AT ACAAC C T AAAG AG GAAG T T CAT 2090 

I II I M II I I I I I I I I I I I I II I I I I II I I I I I I I I I II I II I I 

Db 2 874 GAAT T G C C CC AT GAC CT T T CT T T GAAGAAC AT ACAAC C C AAAGT T GAAGAGAAAAT CAGT 2 933 

Qy 2091 GT C C C AGAT GAGT T CT C CAAAGAT AGG G GT GAT GTT T CAAAGGT GC C C GT ACT GC C T C C A 2150 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I II I I I I I 
Db 2934 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2993 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AG GCAGC AT AGAAAAAC C CAAAGT T C T T 2210 

I I II I I I II I II I II I II I I II I II II II I I I I II II I II II II II I I M I M 
Db 2 994 GAT GTTTCTGCTTT GGC C AC T CAAGC AGAGAT AGAGAGC AT AGT T AAAC C CAAAGT T CT T 3053 

Qy 2211 GT GAAAGAAG C C GAGAGAAAACTT C CT T CT G AT ACAGAAAAAGAGC GAAGAT C T C CAT C T 2270 

I II II I I I I I I I I I I I I I I II I I I I I I II I I II II I I II II I II II I I I I I I I 
Db 3054 GT GAAAGAAGCT GAGAAAAAACTT CCTT C C GATACAGAAAAAGAGGACAGAT CAC CAT CT 3113 

Qy 2271 GCT AT AT T TT CAGCAGAGCT GAGTAAAACTT CAGTT GT T GACCT C CT CTACT GGAGAGAC 2330 

I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I II II I I I II II II I I II II I I II I II 
Db 3114 G CT AT AT T T T CAGCAGAGCT GAGT AAAAC T T C AGT T GT T GACC T C C T GT AC T GGAGAGAC 3173 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I II I I I I I II II I I I I I II I I I I II I I I I I II I I I I I I M I I I I I II I II I I I II 
Db 3174 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3233 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

I I II I I I II II I I I I I I I I I I II II I II II I I I I I II I I I II II II II I II I M I II 
Db 3234 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 32 93 

Qy 2451 T T TAG GAT AT AT AAG GGT GT GAT C CAG GC T AT C CAGAAAT CT GAT GAAGG C CAC C CAT T C 2510 

I M I I M I I II II I I I I I I I II I II I I I I I I I I I I I I I I II I I II I II I I II I I M I 
Db 3294 T TT AG GAT AT AC AAGGGT GT GAT C CAAGC TAT C CAGAAAT CAGAT GAAGG C CAC C CAT T C 3353 



Qy 2511 AG GG CAT AT T T GGAAT C T GAAGT T G C TAT AT C T GAG G AGT T GGT T CAGAAGT AC AGC AAT 257 0 

I I I M I I I I I I I I I I I I I I I II I II I I I I I II II I I I I I I I I I I I I I I I I I I II I III 
Db 3354 AG GGC AT AT C T GGAAT CT GAAGT T G CT AT AT C T GAG GAGT T GGT T CAGAAGT AC AGT AAT 3413 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGT^ACTCAGACGCCTCTTCTTAGTTGAT 2 63 0 

I I I I II I I I I I I I I I I I I I I I I I I I Mill I I I | I I I I II I I I I II | I I | I I | | | | 

Db 3414 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3473 

Qy 2 631 GAT T T AGTT GAT T C T C T GAAGT T T G C AGT GT T GAT GT GGGT AT T T AC CT AT GT T GGT GC C 2690 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I 
Db 3474 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3533 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3534 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3593 

Qy 2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

I I I M I I I I I I I I I I I I I I I II I II I I II I II I I II I I I I II I I I I II I I I I I I I I I 
Db 3594 ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3653 

Qy 2 811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGA 287 0 

I I I I I I I I M I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3654 AAAGAT G C TAT GGC T AAAAT C CAAGCAAAAAT C C CT G GAT T GAAGC G CAAAGC T GAAT GA 3713 

Qy 2871 GAAAGC CT GAAAGAGT TAACAAT AGAGGAGTT T AT C T T T AAAGGG GAT AT T CAT T T GAT T 2930 

M III III I III II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 3714 AAAC G C C C AAAAT AAT T A GT AG GAGTT C AT CTT T AAAGG GGAT AT T CAT T T GAT T 3768 

Qy 2931 C CAT T G GGGAG GGT C AG GGAAGAACAAAG C CT T GAC AT T G C AGT G C AGT T T C AC 2 984 

I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I II I I 
Db 3769 AT AC G GG G GAGGGT C AGGGAAGAAC GAA- C CT T GAC GT T GC AGT GC AGT T T CAC AGAT C G 3827 

Qy 2985 AGAT CT T TAT T T TT AGCAAC GC AGT G- T CT GAGGAAAAAT GAC C T GT C T T GACT G 3038 

I I I I I II I II I I I I I I I I Ml II I I I II I I II I I I I I I I I I I I I II I I I 
Db 382 8 T T GT T AGAT CT T TAT T T T T AGC CAT GC ACT GT T GT GAG GAAAAAT T AC CT GT CT T GAC T G 38 87 

Qy 3039 CCCTGTGTT CAT CAT C T TAAGT AT T GT AAGCT GC TAT GT AT GGAT T TAAAT C GTAAT CAT 3098 

II I II II I II II II II I II II II I I II I II I I I I I II I I I I I I I I II I I I I I I I I I M 

Db 388 8 C CAT GT GT T CAT CAT CT TAAGT AT T GTAAGCT GC TAT GT AT G GAT T T AAAC C GTAAT CAT 3947 

Qy 3099 AT TTGTTTTTCCT GT AT GAG G C ACT GGT GAAT AAAC AAAGAT CT GAGAAAG C T GT AT AT T 3158 

I I I II II I II I II II II II I M I I I I I I I I I I I II II II 

Db 3948 A — T CTTTTTCCTATCT GAGGCACTGGT GGA AT AAAAAAC C T GT AT AT T 3994 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I I I II I I M I II II I II I I I M I I II I I I I I I III I I II I II I 
Db 3995 T T AC TT T GT T GCAGAT AGT CT T GC C G CAT C T T G G CAAGT T G C AGAGAT GGT G GAG CT 4 051 
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14; 



Qy 

Db 



846 



CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 
II I i I I I I I I I I I I I II I I I INI I I I I I I I I I I I I I I I I II I I I I I I II II 
CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 



906 



121 



966 



181 



GT ACT GC C C ACT GAAG GAAC ACT T C C AGCAACT T CAAAT GAAG C T T CTAAAG CAT T C T C A 
III I I I I M I i I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 
GT AT TAG C CACT GAAGGAAC AC T T CAAGAAAAT GT CAGT GAAGCT T C T AAAGAG GT CT C A 



120 



965 



180 



GAGAAG GCAAAAAAT C CAT T T GTAGAGAGAAAT T T AAC AGAAT T T T C AGAAT T GGAAT AT 

I I I II I I I I I I I I II I I I I I I II I I II I I II I I I I I I II I I I I I I I I I II 
GAGAAGG CAAAAAC T CT AC T CAT AGAT AGAGAT T TAAC AGAGT T T T C AGAAT T AGAAT AC 1025 



240 



T C AGAAAT GGAAT CAT CAT T CAGT GG CT C T CAAAAG GC AGAAC CT GC C GT AAC AGT AGC G 

I I II I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 
1026 T C AGAAAT GGGAT CAT CGTT CAGT GT CTCT C CAAAAGC AGAAT CT GCCGTAAT AGTAGCA 1085 

241 AAT C C T AGGGAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAG GAC T T AGT T AGT C T T 300 
I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I II I I I I I I 
108 6 AAT C C T AGG GAAGAAAT AAT C GT GAAAAATAAAGAT GAAGAAG AGAAGT T AGT T AGT AAT 1145 



301 



360 



AAC AT C C T T CAT AC T C AG C AG GAGT TAT C T AC AGT C CT T AC GAAAT C AGT T GAAGAAGAA 

I I I I I I I I I I I I I III II I I I I I I 1 I I I I I I I I I I I I I I III I I I I II 
114 6 AAC AT C C T T CAT AAT C AACAAGAGT T AC CT AC AG CT CT TACTAAAT T G GT T AAAGAG GAT 12 05 



361 



420 



GAT AGAGT T C T GT C T C C AGAAAAAACAAAG GAC AGT T T T AAGGAAAAG G GAGT T GCAGC A 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

1206 GA AGT T GT GT CT T C AGAAAAAGCAAAAGAC AGT T T T AAT GAAAAGAGAGT T GCAGT G 1262 



421 



480 



GAAGCT T C TAT GG GGGAGGAAT AT GC AG AC T T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 

I I I I II I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1263 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1322 



Qy 4 81 AAAGATACTT ACAAGCAAGATAGT GAT GTTT T GAT T GCT GGAGGTAATAT AGAGAGCAAA 54 0 

I I II I I I I III I I I I I I I I I I I I III I I I I I I I I I I I I II I I M I I I I 
Db 1323 AAAGATAGT AAG GAAGAT AGT GAT AT GT T GGCT GCT G GAGGT AAAAT C GAGAG CAAC 137 9 

Qy 541 T T GGAAGGTAAAGT GGAT AAGAAAC AC TT T T C AGAT AG C C T T GAACAAACAAAT C GT GAA 600 

I I I I I I I I I I I I I I I I I I I III III I I 1 II I I I I I I I I I I I I I I I I I III 
Db 138 0 T T G GAAAGT AAAGT G GAT AAAAAAT GT T T T G C AGAT AG C CT T GAG C AAACT AAT C AC GAA 143 9 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GAC ACT T CAT T T C C C AGT AC AC C AGAAGCT GT AAG A 660 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I III 
Db 144 0 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT T C T T T C C C C AGT AC G C C AGAAGGT AT AAAG 14 99 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I I I I I I 1 I I I II I I I I I II I I I I I M I I I I I I I II I I I I I I II I II 

Db 1500 GAT C GT T C AGGAG CAT AT AT CAC AT GT GCT C C CT TT AAC C CAGCAG CAAC T GAGAGC AT T 1559 

Qy 718 T CAACAAACAT TTTTCCCTTGTT GGAAG AT CAT ACT T C G GAAAATAAG AC AG AT GAAAAA 777 

I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II II 

Db 1560 GC AACAAAC AT TTTTCCTTTGT T AGGAGAT C C T ACT T C AGAAAAT AAGAC C GAT GAAAAA 1619 

Qy 77 8 AAGAT AGAA- AAAAAAAG GC ACAAAT T GTAAC AGAGAAGAAT GCAAGT GT C AAG ACAT C A 836 

II I I I I I I I I I I Ml I I I I I I II I II I I I I I I I I I I II III I I I I I I 

Db 1620 AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AAC AGAGAAGAAT AC TAG CAC C AAAAC AT C A 167 9 

Qy 837 AAC C C T T T CCT T AT G GC AGC ACAGGAGT CT AAGAC AGAT T AC GT T ACAAC AG AT CAT GT G 8 96 

I II I I I I I III I I I I I I I I I I I I III I I II I I I I I I II I I I I I I I I I II I 
Db 1680 AAC CCTTTTCTT GT AGCAGC AC AG GAT T CT GAGACAGAT TAT GT CAC AAC AGAT AAT T T A 173 9 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

II I I I I I I I I I I I I I I I I I I II I I I II I I II I I 1 I I I I II II I I II I I I I II 

Db 1740 AC AAAG GT GAC T GAGGAAGT C GT G GC AAAC AT GC CT GAAG G CCT GAC T CC AGAT T T AGT A 1799 

Qy 957 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAG CT ACT GGTACAAAAAT T GC C T T T GAAAC A 1016 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1800 C AGGAAGC AT GT GAAAGT GAAT T GAAT GAAGT TACT G GT ACAAAGAT T GCT TAT GAAAC A 185 9 

Qy 1017 AAAAT GGACCT GGTT CAAACTT CAGAAGCT GT GCAGGAGT CACTTT AC CCT GTAACACAG 107 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I 
Db 1860 AAAAT G GACT T G GT T C AAAC AT C AGAAGT TAT GCAAGAGT CACT C TAT CCT GC AGCAC AG 1919 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1920 CT T T G C C CAT CAT T T GAAGAGT C AGAAGCT ACT C CT T CAC CAGT T T T G C CT GAC AT T GT T 1979 

Qy 1137 ATGGAAGCACCATTA7\ATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 1980 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2039 

Qy 1197 T CAT CAC CAT T AGAAAC T CT T C CT T CAGT T AAT TAT GAAAG CAT AAAGT T T GAG CCT GAA 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I 

Db 204 0 T CAT CAC CAT T AGAAGC T T CT T CAGT T AAT TAT GAAAG CAT AAAAC AT GAG CCT GAA 2096 

Qy 1257 AAT C C C C CAC CAT AT GAG GAGGC CAT GAAT GT AT C ACT AAAAAAAGAAT C AGGAAT GAAT 1316 

II I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M 

Db 2097 AAC C C C C CAC CAT AT GAAGAG GC CAT G AGT GT AT CACTAAAAAAAGT AT C AG GAAT AAAG 2156 

Qy 1317 GAAGAAAT CAC AGAGCCT GAAGGT AT T AGT GT AGCT GT T C AG GAAAC AGAAG C T C C T TAT 137 6 



Db 2157 GAAGAAAT T AAAGAG C C T GAAAAT AT T AAT G C AGC T CT T C AAGAAAC AG AAG CT C C T TAT 2216 

Qy 1377 AT AT C TAT T GC AT GT GAT T T AAT T AAAGAAACAAAGAT C T C T ACT GAAC C GACT C C AG AT 1436 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I III II I I I I I I I I I III 
Db 2217 AT AT C TAT T G CAT GT GAT T TAAT TAAAGAAACAAAG CT T T CT GC T GAAC C AGC T C C G GAT 2276 

Qy 1437 T T C T CT AGT T AT T CAGAAAT AG CAGAAGT T GC AC AGC C AGT G C C C GAG CAT T CT GAGC T A 1496 

I I M I I I I II I I I I I I I I III MINI I I I I M II I I I I II I I I I I I II I I I I 
Db 2277 T T CT C T GAT TAT T CAGAAAT GG CAAAAGT T GAAC AG C C AGT GC C T GAT CAT T C T GAG CT A 2336 

Qy 14 97 GT T GAAGAT TCCTCCCCC GATT C T GAAC C AGT T GACT T ATT T AGT GAT GAT T C AAT AC C C 1556 

I I I I I II I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2337 GT T GAAGAT T C C T CAC C T GATT CT GAAC C AGT T GACT T ATT T AGT GAT GAT T CAAT AC CT 2396 

Qy 1557 GAAGT T CCACAAAAACAAGATGAAGCT GTAATACTTGT GAAAGAAAACCT CACT GAAATT 1616 

II I I I I I I I I I I I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 2397 GAC GTT CCACAAAAACAAGATGAAACT GT GAT GCT T GT GAAAGAAAGT CT CACTGAGACT 2456 

Qy 1617 T CAT C T GAGT CAAT GAC AG GAC AT GAC AAT AAGGGAAAACT C AGT GCT T CAC CAT CAC CT 167 6 

I I I I I I I I I I I I I I I II I I I I I I I II I I I I M II I I I I I I I I I I I I I I I 
Db 2457 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAACT C AGT GCT T T G C CACCT 2513 

Qy 1677 GAG GGAGGAAAAC C GT AT T T GGAGT CT TT T C AG C C CAGT T T AGG CAT C ACAAAAGAT ACC 1736 

I I I I II I I I I I II I I I I I I I I I I I I I I II! I I I I I I II I I I I II I I II I I I I 
Db 2514 GAGGGAGGAAAGCCATATTT GGAAT CTTTTAAGCT CAGTT TAGATAACACAAAAGAT ACC 2573 

Qy 1737 T T AGC AC CT GAT GAAGT T T CAGCAT T GAC C CAAAAGGAGAAAAT C C CT T T GC AGAT GGAG 17 96 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2574 CT GT T AC CT GAT GAAGT T T CAAC AT T GAGCAAAAAGGAGAAAAT T C CT T T G C AGAT GGAG 2633 

Qy 1797 GAG CT CAAT AC T GCAGT T TAT T CAAGT GAT G G CT TAT T CAT T G C T C AGGAAGCAAAC C T A 1856 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I III II I I I I I I I I I II 
Db 2634 GAGCT C AGTACT GCAGT T TAT T CAAAT GAT GAC T TAT T T AT TT CT AAGGAAGC AC AGAT A 2693 

Qy 18 57 AG AGAAAGT G AAACAT T T T C AG AT T CAT C T C C GAT T GAG AT TAT AG AT GAGT T C C C GAC C 1916 

I I I I I I I MINI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2 694 AGAGAAAC T GAAAC GTT T T C AGAT T CAT CT C C AATT GAAAT TAT AGAT GAGT T C C C T AC A 27 53 

Qy 1917 T TT GT C AGTT C TAAAG CAGAT T CT T CT C CT AC AT TAG C CAG GGAAT AC AC T GACCT AGAA 1976 

II I I I I I I I I M I I I I I I I I I III I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 2754 T T GAT CAGTT CTAAAACT GAT T CAT T T T CT AAAT TAG C CAG GGAAT AT AC T GAC C T AGAA 2 813 

Qy 1977 GT AGC C CAC AAAAGT GAAAT T G CT GAC AT C C AGGAT GGAGCT GGGT CAT TGGCTTGTG C A 2036 

III I I II I I I I I I M I I I II I I I I II I I I I I I II I I I I I I I I I I I I I I I II 

Db 2814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2 873 

Qy 2037 G GAT T G C C C CAT GAC CTTTCTTT C AAGAGT AT AC AAC C TAAAG AG GAAGT T CAT 2090 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 287 4 GAAT T GCCCCAT GAC CTTT CTTTGAAGAAC ATACAAC C CAAAGTT GAAGAGAAAAT CAGT 2 933 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I I I I I I I I II I I I I I I I I I III I I I I I I I I II I I I I I I I I I I I 
Db 2934 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2993 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG CAGAGAT AG GCAG CAT AGAAAAAC C CAAAGT T C T T 2210 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2994 GAT GTTTCTGCTTT GGC CAC T CAAGC AGAG AT AGAGAG C AT AGT T AAAC C CAAAGT T C T T 3053 

Qy 2211 GT G AAAGAAG C C G AGAGAAAAC TTCCTTCT GAT AC AG AAAAAG AG C GAAGAT CT C CAT C T 227 0 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I II I I I I I I I I I I | | I | I I I 
Db 3054 GT GAAAGAAG C T GAGAAAAAACT T C CT T C C GAT ACAGAAAAAGAGGAC AGAT CAC CAT C T 3113 

Qy 2271 GC TAT AT T T T CAG C AGAG CT GAGT AAAAC T T C AGT T GT T G AC C T C CT CT AC T G GAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 3114 GC TAT AT T T T CAG C AGAG CT GAGT AAAACT T C AGT T GTT GAC CT C C T GT AC T GGAGAGAC 3173 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3174 AT TAAGAAGACT G GAGT G GT GT T T GGT GC CAG C CT AT T C C T G CT G CT TT C AT T GACAGT A 3233 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 3234 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3293 

Qy 2451 T T TAG GAT AT AT AAGG GT GT GAT C CAG GCT AT C C AGAAAT C T GAT GAAGGC CAC C CAT T C 2510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || I I I I 
Db 3294 TTTAGGATAT ACAAGGGT GT GATC CAAGCTAT CCAGAAAT CAGAT GAAGGCCACCCATT C 3353 

Qy 2511 AGGGC AT AT T T GGAAT CT GAAGTT GCT AT AT C T GAG GAGT T GGT T C AGAAGT AC AG C AAT 2570 

I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M III 
Db 3354 AG GG CAT AT C T GGAAT CT GAAGTT GCT AT AT CT GAGGAGT T G GT T C AGAAGT ACAGT AAT 3413 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 3414 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 347 3 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3474 GAT T T AGTT GAT T C T C T GAAGT T T G CAGT GTT GAT GT GGGT AT T T AC CT AT GT T GGT GC C 3533 

Qy 2691 TT GTT CAAT GGT CTGACACTACTAATTTTGGCTCTGATTTCACTCTT CAGT GTT CCT GTT 2750 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3534 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTT CAGT GTT CCTGTT 3593 

Qy 2751 AT T TAT GAAC GGC AT C AGGC GCAAAT AGAT CAT TAT C T G GGAC T T G CAAAT AAGAAT GTT 2 810 

I I I I I I I M I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3594 AT TT AT GAAC GGC AT C AGGC AC AGAT AGAT CAT TAT C TAG GACT T GCAAAT AAGAAT GT T 3653 

Qy 2811 AAAGAT GCT AT GGCTAAAAT CCAAGCAAAAAT C C CT GGATT GAAGC GTAAAGCT GAAT GA 2 87 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 3654 AAAGAT GCT AT G GCT AAAAT C CAAGCAAAAAT C C CT G GAT T GAAG C G CAAAGCT GAAT GA 3713 

Qy 2871 GAAAG CCT GAAAGAGT T AACAAT AGAG GAGT T TAT CT T T AAAG G GGAT ATT C AT TT GAT T 2930 

II III III I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3714 AAAC G C C C AAAAT AAT T A GT AG GAGT T CAT CT T T AAAG G GGAT ATT CAT TT GAT T 3768 

Qy 2 931 C CAT T G G GGAGGGT CAG GGAAGAACAAAG C CT T GAC AT T G CAGT G C AGT TT CAC 2984 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 37 69 ATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3 827 

Qy 2985 AGAT C T T TAT T T TT AGCAAC G CAGT G - T C T GAG GAAAAAT GAC CT GT CT T GAC T G 3038 

I I I I I M I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 382 8 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 38 87 



Qy 3 039 CCCTGTGTT CAT CAT C T T AAGT AT T GT AAGCT GC TAT GT AT G GAT T T AAAT CGTAAT CAT 3 098 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 38 8 8 C CAT GT GT T CAT CAT C T T AAGT AT T GTAAG C T GC TAT GT AT GGAT T T AAAC C GT AAT CAT 3 947 

Qy 3099 AT TTGTTTTTCCT GT AT GAG GC ACT GGT GAAT AAACAAAGAT C T GAGAAAG CT GT AT AT T 3158 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 3948 A — TCTTTTTCCTATCTGAGGCACTGGTGGA AT AAAAAAC C T GT AT AT T 3994 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

I I I I I I I I I II II M I I I I I I II I I I I I I I I I I III I II I I I I I 
Db 3995 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4051 
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Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II II 
Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GT AC T G C C CAC T GAAGGAAC ACTT C CAGCAACT T CAAAT GAAGCT T CT AAAGC AT T C T C A 12 0 

II I I I I I I I II I II I I I I I I I I I I! II I I I II I I I I I I I I I I I M I I I 
Db 772 GT AT T AC C CAC T GAAGGAAC ACTT CAAGAAAAT GT C AGT GAAG C T T CTAAAGAGGT C T C A 831 

Qy 121 GAGAAG G CAAAAAAT C CAT T T GT AG AG AGAAAT T T AAC AGAAT T T T C AGAAT T GGAAT AT 180 

I I I I I I I I I I I I I II I I I I I I III I I I I I I I I I I II I I I I I 

Db 832 GAGAAGGCAAAAACT CT ACT CATAGATAGAGATTTAACAGAGT T T T CAGAAT TAGAAT AC 891 

Qy 181 T C AGAAAT GGAAT CAT CAT T CAGT G G C T CT CAAAAG GC AGAAC C T G C C GT AAC AGT AGC G 24 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I 



Db 8 92 T C AGAAAT G GGAT CAT C GT T C AGT GT CT C T C CAAAAGC AGAAT C T G C C GT AAT AGT AGC A 951 

Qy 241 AAT C CTAGGGAC GAAAT AGT T GT GAGGAGT AGAGATAAAGAAGAGGACT TAGT TAGT CT T 300 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 952 AAT CCTAGGGAAGAAATAAT C GT GAAAAATAAAGAT GAAGAAGAGAAGTTAGTT AGT AAT 1011 

Qy 301 AAC AT C C T T CAT AC T C AG C AG G AGT TAT C T AC AGT C C T T AC GAAAT C AGTT G AAGAAGAA 360 

I I I I I I II I I I I I III II IMIII MINI I I I I I II I I I I I I I I I II 
Db 1012 AAC AT C CTT CATAAT CAACAAGAGTT AC CT ACAGCT CTTACTAAAT T GGTTAAAGAGGAT 1071 

Qy 361 GAT AGAGT TCTGTCTC CAGAAAAAAC AAAG GAC AGT T T T AAGGAAAAGG GAGT T G CAGC A 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I II 

Db 1072 GA AGT T GT GT CT T C AGAAAAAG CAAAAGAC AGT T T TAAT GAAAAG AGAGT T GCAGT G 1128 

Qy 421 GAAG CTT CT AT GGG G GAGGAAT AT GC AGACT T CAAAC CAT T T GAGC GAGT AT GG GAAGT G 480 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1129 GAAGC T C CT AT GAGG GAGGAAT AT G C AGAC T T CAAAC CAT T T GAGC GAGT AT G GGAAGT G 1188 

Qy 481 AAAGATAC T T ACAAGCAAGAT AGT GAT GT T TT GAT T G CT G GAG GTAAT AT AGAGAGC AAA 54 0 

I I I I I I I I III I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I I I I I 
Db 118 9 AAAGATAGT AAGGAAGATAGT GAT AT GTT GGCT GCT GGAGGTAAAAT CGAGAGCAAC 1245 

Qy 541 T T GGAAGGT AAAGT G GATAAGAAACACT T T T C AGAT AG C CT T GAAC AAACAAAT C GT GAA 60 0 

I I I I I I I I I I I I I I I I I I I III III I I I II I I I I I I I I I I I I I I I I I III 
Db 1246 T T GGAAAGT AAAGT GGAT AAAAAAT GT T T T GC AGAT AGC CTT GAG CAAACTAAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GAC ACT T CAT T T C C C AGT ACAC C AGAAGCT GTAAGA 660 

I I I I I I I I I I I II I I I I I I I II I I I I I I I II I M I I I I I I I I I I I I I II I 
Db 1306 AAAGATAGT GAGAGT AGTAAT GAT GAT AC TTCTTTCCC CAGT AC GC C AGAAGGT AT AAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACT GAGAATGTT 717 

I I II I I I I I I II I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I II 

Db 1366 GAT C GT C C AGGAG CAT AT AT C AC AT GTGCTCCCTT T AAC C C AG C AGCAACT GAGAGC AT T 1425 

Qy 718 T CAACAAACAT TTTTCCCTTGTTG GAAGAT CAT ACTT C GGAAAAT AAGACAGAT GAAAAA 77 7 

I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I II I I I 
Db 142 6 GCAACAAACAT TTTTCCTTTGT TAGGAGAT C C T ACTT C AGAAAATAAGAC C GAT GAAAAA 14 85 

Qy 778 AAGAT AGAA- AAAAAAAGG C ACAAAT T GT AAC AG AGAAGAAT G CAAGT GT C AAGACAT C A 836 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 

Db 1486 AAAAT AGAAGAAAAGAAGGCC CAAATAGTAACAGAGAAGAAT ACTAGCACCAAAACAT CA 1545 

Qy 837 AAC C CT T T C CT T AT G GC AG C ACAGGAGT CT AAGACAGAT T AC GT T ACAAC AGAT CAT GT G 896 

I I I I I I I I III I I I I I I I I I I I I III I I I I I I I I I I II I I I I I I I I I II I 
Db 154 6 AAC CCT TT T CTT GTAGCAGC ACAGGATT CT GAGACAGATT AT GT CACAACAGATAATT TA 1605 

Qy 897 T CAAAG GT GAC C GAG GAAGT AGT G G CAAAC AT G C C T GAAG GT CT AAC C C C AGAT T T GGT T 956 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II 
Db 1606 AC AAAG GT GACT GAG GAAGT C GT G G CAAAC AT G C C T GAAG GC C T GAC T C CAG AT T TAGT A 1665 

Qy 957 CAG GAAG CAT GT GAAAGT GAAT T GAAT GAAGC TACT GGT ACAAAAAT T GC CT T T GAAAC A 1016 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1666 C AGGAAG C AT GT GAAAGT GAAT T GAAT GAAGT T ACT G GT ACAAAGAT T GCT TAT GAAAC A 1725 

Qy 1017 AAAAT G GAC CT GGT T CAAACTT CAGAAGCT GT GCAGGAGT CACT TT AC CCT GT AAC AC AG 1076 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I 
Db 1726 AAAAT G GACTT GGT T CAAAC AT C AGAAGT TAT G CAAGAGT CACT C TAT C CT G CAG CAC AG 1785 



Qy 107 7 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I II I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 17 86 CTT T G C C CAT CAT T T GAAGAGT CAGAAG CT ACT C CT T C AC C AGT T T T G C C T GAC AT T GTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I III 
Db . 1846 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 T CAT CAC CAT TAGAAAC TCTTCCTT C AGT TAAT TAT GAAAG C AT AAAGT T T GAG C CT GAA 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1906 T CAT CAC CAT T AGAAG C TT C T T C AGT TAAT TAT GAAAGCAT AAAACAT GAGC CT GAA 19 62 

Qy 1257 AAT CCC C CAC CAT AT GAGGAGGC CAT GAAT GT AT CACTAAAAAAAGAAT CAGGAAT GAAT 1316 

II I I I I I I I I II I I I I I I I I II I I I I I I I I I I i I II I I I I I I I I I I I I I I I I II 

Db 1963 AAC C C C C CAC CAT AT GAAGAGGC CAT GAGT GT AT CACTAAAAAAAGT AT CAGGAAT AAAG 2 022 

Qy 1317 GAAGAAAT C ACAGAGC CT GAAGGT AT T AGT GT AGCT GTT CAGGAAAC AGAAG CT C C T TAT 1376 

I I I I I I I I I I I I I I I I I I I I II I I II I I I I MM II I I II II I II II I II II 
Db 2 023 GAAGAAAT T AAAGAGC CT GAAAAT AT TAAT GC AG CT C T T CAAGAAAC AGAAGCT C C T TAT 2 082 

Qy 1377 AT AT CT ATT GCAT GT GAT T TAAT TAAAGAAACAAAGAT CT C T AC T GAAC C GAC T C CAGAT 1436 

I I I II I II II I II II I I II I II I I I II II I II I I I I I III II II I II I II I III 
Db 2083 AT AT CT AT T GCAT GT GAT T TAAT T AAAGAAACAAAG CT T T CT GCT GAAC C AG CT C C G GAT 2142 

Qy 14 37 T T C T CT AGTT AT T C AGAAAT AGC AGAAGT T GC AC AGC C AGT GC C C GAG CAT T CT GAGC T A 14 96 

I II II I II II II I I I I I I III I I I I I I I I I II I I I I I M II II I I I II M I II 

Db 214 3 T T C T C T GAT TAT T C AGAAAT GGCAAAAGT T GAAC AG C C AGT GC C T GAT CAT T C T GAGC T A 2202 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T CT GAAC C AGT T GAC T TAT T T AGT GAT GAT T C AAT AC C C 1556 

M I II II I II II I I II I I I M I II I II II II I I II I I II I II I I II I M II II II I I 
Db 2203 GTT GAAGATT CCT CACCT GATT CT GAACCAGTT GACT T ATTTAGT GAT GATT CAATAC CT 22 62 

Qy 1557 GAAGT T C CACAAAAACAAGAT GAAG CT GTAAT AC T T GT GAAAGAAAAC CT CAC T GAAAT T 1616 

II I I II I II II M II I I II I II I I I II II I I I II I I I M II I II I I I II I I I 

Db 22 63 GAC GT T C CACAAAAACAAGAT GAAACT GT GAT G CT T GT GAAAGAAAGT CT C ACT GAGACT 2322 

Qy 1617 T CAT CT GAGT CAAT GAC AG GAC AT GACAATAAG GGAAAACT C AGT G CT T CAC CAT CAC CT 1676 

II I I I I I I II I I I II II I II II II I I I II II II I I I I II I II I I II I II 
Db 2323 T CATTT GAGT CAAT GATAGAAT AT GAAAAT AAGGAAAAACT CAGT GCT TT GC CACCT 2379 

Qy 1677 GAG GGAGGAAAAC C GT AT T T G GAGT C T T T T CAG C C CAGT T T AGG C AT C AC AAAAGAT AC C 1736 

I I I I I I II I I I II I I II I I I I I II I I I III II I II I I I I II M II I I II I I I 
Db 238 0 GAG G GAGGAAAG C CAT AT T T GGAAT C T T T T AAG C T CAGT T T AGAT AAC ACAAAAGAT AC C 24 39 

Qy 1737 T TAG CAC C T GAT GAAGT T T C AGC AT T GAC C CAAAAG GAGAAAAT C C CT TT GCAGAT GGAG 17 96 

I I I I I I II I II II I II I I I I I II I I I I I I I II I I I I I I II II I II II I II I I 
Db 2440 C T GT T AC C T GAT GAAGT T T CAAC AT T GAG CAAAAAGGAGAAAAT T C C T TT G CAGAT GGAG 2499 

Qy 1797 GAG C T CAATAC T G CAGT T TAT T CAAGT GAT G GC T T ATT CAT T GC T CAG GAAG C AAAC C T A 1856 

I II I I I I I I I I I II I I I I I II I I I I I I I I I I I I II Ml II I I II I II I I II 
Db 2500 GAG C T CAGT ACT G CAGT T TAT T CAAAT GAT GACT T ATT TAT T T C T AAG GAAG CAC AGAT A 2559 

Qy 1857 AGAGAAAGT GAAAC AT T T T CAGAT T CAT CT C C GAT T GAGAT TAT AGAT GAGT T CCC GAC C 1916 

I II I M I II II I I I I I I I I I I I I I I II I I I I I I II I II I I I I II I I I I II I I M 
Db 2560 AGAGAAACT GAAAC GT T T T CAGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T CCC T AC A 2619 



Qy 1917 TT T GT CAGT T C T AAAGC AGAT T CT T CT C CT AC AT TAG C CAGG GAAT AC AC T GAC CT AGAA 1976 

II I I I I I I I I I I I I I I I I I I I II! I I II I I I I I I I I I I I I I I II I I I I I I I 

Db 2 620 TT GAT CAGT T C T AAAAC T GAT T CAT T T T C T AAAT TAGC C AG GGAAT AT AC T GAC C T AGAA 2 67 9 

Qy 1977 GT AG C C CACAAAAGT GAAAT T GC T GAC AT C C AGGAT G GAG CT G GGT CAT TGGCTTGT GC A 2036 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I III II 

Db 2 68 0 GTATCC CACAAAAGT GAAATTGCTAATGCCCCGGATGGAGCTGGGTC ATT GCCTTGCACA 27 39 

Qy 2 037 GGAT T GC C C CAT GAC CTTTCTTT C AAG AGT AT AC AAC C T AAAGAG GAAGT T CAT 2 0 90 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I 

Db 2740 GAAT T GCC C CAT GAC CT T T CTTT GAAGAACAT ACAACC CAAAGTT GAAGAGAAAAT CAGT 27 99 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I I I I I I I II I I III II II II I I I I I I I I II I I II I I I I I I I I 
Db 2800 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAG C AGAGAT AGGC AG CAT AGAAAAAC C CAAAGT T CT T 2210 

I I I I I I I I I I I I I I I I I I II I I II I I I I I II I I I I I I I I I I II I I I I I I I I I I 
Db 2 8 60 GAT GT T TCTGCTTTGGC C AC T CAAGC AGAGAT AGAGAGC AT AGT T AAAC C CAAAGT T C T T 2 919 

Qy 2211 GT GAAAGAAGC C GAGAGAAAACTT C C T T C T GAT ACAGAAAAAGAG CGAAGAT C T C CAT CT 2270 

I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINI 
Db 2920 GT GAAAGAAGC T GAGAAAAAACT T C CT T C C GAT ACAGAAAAAGAGGAC AGAT C AC CAT C T 2 979 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I i I I II I 
Db 2980 GCT AT AT T T T C AG CAGAGC T GAGT AAAACTT C AGT T GT T G AC CT C CT GT ACT G GAGAGAC 3039 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I II I I I I I Mill I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I MINI 
Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 2451 T T T AGGAT AT AT AAG GGT GT GAT C CAG GC T AT C CAGAAAT CT GAT GAAG GC CAC C CAT T C 2510 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I II I I I I I I I I 
Db 3160 T T TAG GAT AT ACAAG G GT GT GAT C CAAG CT AT C CAGAAAT CAG AT GAAG GC CAC C CAT T C 3219 

Qy 2511 AG GG CAT AT T T GGAAT CT GAAGT T GCT AT AT C T GAG GAGT T GGT T C AGAAGT AC AG C AAT 2570 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I III 
Db 322 0 AG GGC AT AT CT G GAAT C T GAAGT T G C TAT AT CT GAGG AGT T G GT T C AGAAGT AC AGT AAT 3279 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAA7\AGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3280 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 690 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3399 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 59 

Qy 2751 AT T TAT GAAC G G CAT C AGG C G C AAAT AGAT CAT TAT CT GGGAC T T G C AAATAAGAAT GT T 2810 






Db 



3460 AT T TAT GAAC G GC AT CAG GC GCAGAT AGAT CAT TAT CT AG GAC T T G CAAAT AAGAAT GT T 3519 



Qy 



Db 




RESULT 6 

US-09-893-348-22 

; Sequence 22, Application US/09893348 
; Patent No. US200200724 93A1 
; GENERAL INFORMATION: 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 
; APPLICANT: COHEN, Irun R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT: MOSONEGO, Alon 
; APPLICANT: MOALEM, Gila 

; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

; FILE REFERENCE: EIS-SCHWARTZ=2A 

; CURRENT APPLICATION NUMBER: US/09/893, 348 

; CURRENT FILING DATE: 2001-06-28 

; PRIOR APPLICATION NUMBER: US 09/314,161 

; PRIOR FILING DATE: 1999-05-19 

; PRIOR APPLICATION NUMBER: US 09/218,277 

; PRIOR FILING DATE: 1998-12-22 

; PRIOR APPLICATION NUMBER: PCT/US98/14715 

; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

; NUMBER OF SEQ ID NOS : 29 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 22 

LENGTH: 3579 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: (1) . . (3579) 
OTHER INFORMATION: 
US-09-893-348-22 

Query Match 58.0%; Score 2223.6; DB 9; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTG7^ATACCTTGGTGATTTACCAGCA 60 

II I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M 
Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GT AC T GC C CAC T GAAG GAACACTT C CAG CAACT T CAAAT GAAGC T T C T AAAGC ATT CT C A 12 0 





Db 



772 GT ATT AC C CAC T GAAG GAACACTT CAAGAAAAT GT CAGT GAAGCT TCTAAAGAGGT CT CA 831 



Qy 



121 GAGAAGGCAAAAAAT C CAT T T GTAGAGAGAAAT T TAAC AGAAT T T T C AGAAT T GGAAT AT 180 
I I I I II I I II I I I II I I I I I I II I I I I I I II I I I I I I I I I I I I I I II I I I 



Db 832 GAG AAGGC AAAAACT C TAG T C AT AGAT AGAGAT T T AAC AGAGT T T T C AG AAT T AGAAT AC 891 

Qy 181 T CAGAAAT G GAAT CAT CAT T C AGT G G CT C T CAAAAGG CAGAAC CT G C C GT AACAGTAGC G 24 0 

I I I I I I I I I I I i M M I I I I I II I I I I I III I I I I I I I I I I I I I I I I I I | I I 
Db 892 T CAGAAAT G G GAT CAT C GT T C AGT GT CT C T C C AAAAG CAGAAT CT GC C GT AAT AG TAG C A 951 

Qy 241 AAT C CTAGGGACGAAAT AGTT GT GAG GAG T AGAGAT AAAGAAGAGGACTT AGT T AGT CTT 300 

I I I I I I I I M I I I I I I I I I I I I I I I I I I | | | | | | | || | | | | | | | | | | | 
Db 952 AAT C C T AGGGAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGTAAT 1011 

Qy 301 AACAT C CTTCATACT CAGCAGGAGTT AT CT ACAGT CCTTACGAAAT CAGTT GAAGAAGAA 360 

M I I I I I I I I II I III II I I I I I I I I I I I I I I I I I I I I I Ml I I I I || 
Db 1012 AACAT C CT T CAT AAT CAAC AAGAGT T AC C T ACAGC T C TT AC TAAAT T GGT TAAAGAGGAT 1071 

Qy 3 61 GAT AGAGT TCTGTCTC C AGAAAAAACAAAGG AC AGTT TTAAGGAAAAGG GAGT T G CAGC A 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1072 GA AGT T GT GT C T T C AG AAAAAGC AAAAGAC AGT T TTAAT GAAAAGAGAGT T GCAGT G 1128 

Qy 421 GAAGCTT CTATGGGGGAGGAATAT GCAGACT T CAAACC ATTT GAGC GAGT AT GGGAAGT G 4 80 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | || || | | | | 
Db 112 9 GAAGC T C C TAT GAGGGAG GAAT AT GCAGACT T CAAAC C ATT T GAGC GAGT AT GGGAAGT G 1188 

Qy 481 AAAGAT ACTTACAAGCAAGATAGT GAT GTTTT GATT GCT GGAGGTAAT ATAGAGAGCAAA 540 

I I M I I I I III I I II I I I I I I I I Ml | || | M II I II I II I I I I I I I I 
Db 1189 AAAGATAGT AAGGAAGATAGT GAT AT GTT GGCTGCT GGAGGTAAAAT CGAGAGCAAC 1245 

Qy 541 T T GGAAG GTAAAGT GGATAAGAAACAC T T T T CAGAT AGC CT T GAACAAACAAAT C GT GAA 600 

I I I I I I I I I I I I I I I I I I I III III || || | | M | | | | | | M I I I I II III 
Db 1246 T T GGAAAGTAAAGT GGATAAAAAAT GT T T T GC AGAT AG C CT T GAGCAAAC T AAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGCAGTAAT GAT GACACTT CAT TT CCCAGT ACACCAGAAGCT GTAAGA 660 

I I I I I I I I I I I M I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I III 
Db 1306 AAAGATAGT GAGAGT AGTAAT GAT GAT AC TTCTTTCCC CAGT ACG C C AGAAG GT ATAAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I M I I I I I I M I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I II 
Db 1366 GAT C GT C C AG GAG CAT AT AT CAC AT GT GCTCCCTT TAAC C C AG C AGCAACT GAGAGC AT T 1425 

Qy 718 T CAACAAACAT TTTTCCCTTGTT GGAAGAT CAT ACT T C G GAAAATAAGAC AGAT GAAAAA 77 7 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I || I | I M I I I I I I 
Db 142 6 GCAACAAAC AT T T T T C CT TT GT T AGGAGAT C CT AC T T C AGAAAAT AAGAC C GAT GAAAAA 14 85 

Qy 778 AAGATAGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGACAT CA 836 

M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I || I I I I I I I I I 
Db 148 6 AAAAT AGAAGAAAAGAAG GC C CAAAT AGT AAC AGAGAAGAAT ACT AGC AC C AAAACAT C A 154 5 

Qy 837 AAC CCTTTCCT TAT G G C AGC AC AGGAGT CTAAGACAGAT T AC GT T ACAACAG AT CAT GT G 896 

I I I I I I M Ml I I 1 I I II I I I I I III I I I II I I I I I II I I I I I I I I I II I 
Db 154 6 AAC CCTTTTCTT GT AGC AGC AC AG GAT T C T GAGACAGAT TAT GT C ACAACAGAT AAT T T A 1605 

Qy 8 97 T CAAAG GT GAC C GAG GAAGT AGT G G CAAACAT G C CT GAAG GT CT AAC C C CAGAT T T GGT T 956 

I'll' I I I I I I I I I I I I I I I I I I I I II I I I II I II II I I I I I I I I II 

Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 C AG GAAG CAT GT GAAAGT GAAT T GAAT GAAG CT ACT GGT ACAAAAAT T G C C T T T GAAAC A 1016 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1666 CAG GAAG CAT GT GAAAGT GAAT T GAAT GAAGT T AC T G GT ACAAAG AT T GCT TAT GAAAC A 1725 



Qy 1017 AAAAT G GAC CT G GT T C AAAC T T C AG AAG C T GT G C AGG AGT C ACT T T AC C C T GT AAC AC AG 1076 

I I I I I I I II II I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 172 6 AAAAT GGAC T T GGT T CAAACAT C AGAAGT TAT GCAAGAGT C AC T C TAT C C T GC AG C AC AG 17 85 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I II I II I I I I 
Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I II I I I I I I I III 
Db 184 6 AT GGAAGC AC CAT T GAAT T CT G C AGT T C C T AGT GCTGGTGCTTCCGT GAT AC AG C C C AGC 1905 

Qy 1197 T CAT C AC CAT T AGAAAC TCTTCCTT C AGT T AAT TAT GAAAGC ATAAAGT T T G AGC C T GAA 1256 

I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1906 T CAT C AC CAT T AGAAGC TT CT T CAGTTAATTAT GAAAGC AT AAAAC AT GAGCCT GAA 1962 

Qy 1257 AAT C C C C CAC CAT AT GAGG AGGC CAT GAAT GT AT C ACT AAAAAAAGAAT CAG GAAT GAAT 1316 

II I I I I I II II I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II 

Db 1963 AAC C C C C CAC CAT AT GAAGAGG C CAT GAGT GT AT CAC TAAAAAAAGT AT CAGGAATAAAG 2022 

Qy 1317 GAAGAAAT C AC AGAG C CT GAAGGT AT T AGT GT AG CT GT T CAGGAAAC AGAAG CT CCT T AT 137 6 

I I I I I I I I I I I I I I I I I I I I I I I I II MM I I II I I M I I I II I II II II II 
Db 2023 GAAGAAAT TAAAGAGC C T GAAAAT AT T AAT G C AGC T CT T CAAGAAAC AGAAG C T C CT T AT 2082 

Qy 1377 AT AT CT AT T G CAT GT GAT T TAAT T AAAGAAACAAAGAT CT C TACT GAAC C GACT C C AGAT 14 36 

I II I I II II I I I I I II II II I I I I I II I I I I I II II I II I M I I I I I I I II III 
Db 2083 AT AT CT AT T GC AT GT GAT T T AATTAAAGAAACAAAGCT TT CT G CT GAAC C AGCT C C GGAT 2142 

Qy 1437 T T C T CT AGT TAT T CAGAAAT AG C AGAAGT T G CAC AG C CAGT GC C C GAGCAT T CT GAGCT A 14 96 

I M I II I I I II I I II II I II I MINI II I I I II I II II II I I I I I I II II II 
Db 2143 T T CT CT GAT TAT T CAGAAAT G GCAAAAGT T GAACAGC CAGT G C CT GAT CAT T CT GAGCT A 2202 

Qy 14 97 GT T GAAGAT TCCTCCCCC GATT CT GAAC C AGT T GAC T TAT T T AGT GAT GAT T C AAT AC C C 1556 

I I I I I I I I I II II I II II I I II II I II M I II I I II II II II I I I I I I II I I II II I 

Db 22 03 GT T GAAGAT T C CT CAC CT GAT T CT GAAC CAGT T GACT TAT T T AGT GAT GAT T CAAT AC C T 2262 

Qy 1557 GAAGTT C C ACAAAAACAAGAT GAAGC T GT AAT ACT T GT GAAAGAAAAC C T CAC T GAAAT T 1616 

II I II I I I I II I II I I I I I II I I I I I I II II I II II II II I I I II I II II I I 

Db 2263 GAC GTT C C ACAAAAACAAGAT GAAACT GT GAT GC T T GT GAAAGAAAGT CT C ACT GAGAC T 2322 

Qy 1617 T CAT CT GAGT CAAT GAC AGGAC AT GAC AAT AAG GGAAAACT CAGT GC T T CAC CAT CAC CT 1676 

II I I II I I I I II II I II I II II II I I II I I II I I II II I I I II I Mill 
Db 2323 T C ATTT GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAAC T CAGT GCT T T G C CACCT 237 9 

Qy 1677 GAG G GAG GAAAAC C GT AT T T GGAGT CT T T T CAG C C CAGT T TAG G CAT C ACAAAAGAT AC C 17 36 

I I II II I I II I II II II I I II II I II I III I I II II I I I I II I I I I II II II 

Db 238 0 GAG G GAG GAAAG C CAT AT T T GGAAT CT T T T AAG CT CAGT T T AGAT AAC AC AAAAG AT AC C 2 4 39 

Qy 1737 T TAG CAC CT GAT GAAGT T T C AGC AT T GAC C C AAAAG GAGAAAAT C C C T T T GC AG AT GGAG 1796 

I II II I I II I II I I I II I I II I I I I II II II I I I II I II I I I I I I I II II I I 
Db 24 4 0 CT GT T AC CT GAT GAAGT T T CAAC AT T GAG C AAAAAG GAGAAAAT T C C T T T G C AGAT G GAG 24 9 9 

Qy 1797 GAGCT CAAT AC T G CAGT T TAT T CAAGT GAT G GCT TAT T CATT GC T CAGGAAG C AAAC C T A 1856 

II II II I I I II I I I I I II II I I I I I I I II I I I II I III II I II I I I II I II 

Db 2500 GAGCT CAGT ACT G CAGT T TAT T CAAAT GAT GAC T TAT T TAT T T CT AAG GAAG CAC AGAT A 2559 



Qy 1857 AGAGAAAGT GAAACATTT T CAGATT CAT CT C C GAT T GAG AT TAT AG AT GAGTT CCC GAC C 1916 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 
Db 2560 AGAGAAACT GAAAC GT T T T C AGAT T CAT CT C C AAT T GAAAT T AT AGAT GAGT T CCC T AC A 2619 

Qy 1917 TT T GT C AGT T CT AAAG CAGAT T C T T C T C CT ACAT TAG C C AG G GAAT AC ACT GAC C T AGAA 1976 

II I I I I II I I I I I I I I I I I I I Ml I I I I || I I I | | | | | | | | | | || | | || | | 
Db 2620 T T GAT C AGT T C T AAAAC T GAT T CAT T T T C TAAAT T AGC CAG GGAAT AT ACT GAC C T AGAA 2 679 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

Ml I I I I I I I I I II I II I II I I I I II I I I I I I I I I I II I I I I I I I I I I I II 
Db 2680 GT AT C C C AC AAAAGT GAAAT T G C TAAT GC C C C G GAT GGAGC T GGGT CAT T GC CT T GC AC A 2739 

Qy 2037 G GAT T GC C C CAT GAC CTTTCTTT C AAGAGT AT AC AAC C TAAAGAGGAAGTTCAT 2 09 0 

I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I i II | 

Db 274 0 GAAT T G C C C CAT GAC CTTTCTTT GAAGAACAT ACAAC C C AAAGT T GAAGAGAAAAT C AGT 27 99 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I I I I I I I II II III II M II I I I I I I I I I I I I II I I I M II I 
Db 2 8 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2 859 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAGC AGAGAT AG G CAG C AT AGAAAAAC C CAAAGT T CTT 2210 

I I I I I M I I I I I I I I I I I I I I I I I I II I I II I II I I I I M I I I I I I I I I I I I I 
Db 2860 GAT GT T TCTGCTTTGGC C ACT CAAGC AGAGAT AGAGAG CAT AGT T AAAC C CAAAGT T C T T 2919 

Qy 2211 GT GAAAGAAGC CGAGAGAAAACT T C C TT C T GAT AC AGAAAAAGAG C GAAGAT CT CC AT CT 227 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I II I 

Db 2920 GT GAAAGAAGCTGAGAAAAAACTT C CTT CCGATAC AGAAAAAGAGGACAGAT CACCAT CT 2979 

Qy 2271 GCT AT AT T T T C AGCAGAG C T GAGTAAAAC TT C AGT T GT T GAC C T C CT CT ACT G GAGAGAC 2330 

II I I I II II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 29 8 0 GCTATATTTT CAGCAGAGCT GAGTAAAACTT CAGTT GT TGAC CT C CT GTACT GGAGAGAC 3039 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I M I I I I II I I I II I II I I II I I I II I M I I I I I I I I I I I I I I I I I MINIM 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 30 99 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I II II I II I I I Mill I I I II I I I I I I I I I I I I I II I I II I I II M I I I M I I I 
Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 2451 T T T AGGAT AT ATAAG G GT GT GAT C C AGG CT AT C C AGAAAT C T GAT GAAGGC C AC C CAT T C 2510 

I I I I I I I I I I I I II II I I II I I I II I II I II I I I I I I I I I I II II I I || || I II II I 
Db 3160 T T T AGGAT AT ACAAGGGT GT GAT C CAAGC TAT C CAGAAAT CAGAT GAAG G C C AC C CAT T C 3219 

Qy 2511 AG G G CAT AT T T G GAAT C T G AAGT T G C TAT AT C T GAG G AGT T G GT T C AGAAGT AC AG C AAT 2570 

I I I I I I I I I I I II I I II I I I II I I I I I I I I II I II I II I I II II I II I II I I I I I Ml 
Db 3220 AGG GC AT AT C T G GAAT CT GAAGTT G CTAT AT CT GAGGAGT T GGT T C AGAAGT ACAGT AAT 3279 

Qy 2571 TCTGCTCTTGGT CAT GT TAAC T GCACAAT AAAAGAACT C AGAC GC CT C T T C T T AGT T GAT 2630 

I I I I I I I II I I I II I I I I I I I I I II I II I I I I I I II I I I I I I II I II I II I II I I I 
Db 32 8 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GATTTAGTT GAT TCTCT GAAGTT TGCAGTGTT GAT GTGGGTATTTACCTATGTT GGT GCC 2690 

I M I I I I M I I I II I II I I M I I I I I I I II II I I I I I II I I I II I I I II I II II I I I I I I 
Db 3340 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3399 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 



1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 

Db 3400 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3459 

Qy 2 751 AT T TAT GAAC GGC AT C AGG C G CAAAT AGAT CAT TAT C T GGGAC T T G C AAAT AAGAAT GTT 2 810 

I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I 
Db 34 60 AT T TAT GAAC GG CAT C AGG C G CAGAT AGAT CAT TAT C T AGGACT T GCAAAT AAGAAT GTT 3519 

Qy 2 811 AAAGAT GC T AT G G CT AAAAT C C AAG CAAAAAT C C CT GGATT GAAG C GT AAAGCT GAAT GA 2870 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3520 AAAGAT GCT AT GGC T AAAAT C CAAGCAAAAAT C C CT GGATT GAAG C G CAAAGC T GAAT GA 3579 



RESULT 7 

US-10-267-502-212 

Sequence 212, Application US/10267502 
Publication No. US20040071700A1 
GENERAL INFORMATION: 
APPLICANT: Kim, Jaeseob 
APPLICANT: Galant, Ron 

TITLE OF INVENTION: Obesity Linked Genes 
FILE REFERENCE: LSD-07416 

CURRENT APPLICATION NUMBER: US/ 1 0/2 67 , 5 02 
CURRENT FILING DATE: 2 003-01-27 
NUMBER OF SEQ ID NOS : 439 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 212 
LENGTH: 3579 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-267-502-212 

Query Match 58.0%; Score 2223.6; DB 12; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I II I I I I I I I I I II I I II I I I I I I I I I I II I I I I I I I I I I II I II II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GT AC T G C C C ACT GAAG GAACAC TT C C AG CAACT T CAAAT GAAGC T T CT AAAG CAT T CT C A 120 

III I I I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I 

Db 772 GT AT T AC C C ACT GAAG GAACAC T T CAAGAAAAT GT CAGT GAAGCT T CT AAAGAGGT CT C A 831 

Qy 121 GAGAAG G C AAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T C AG AAT T G GAAT AT 180 

I I I I I I I I I I I I I II I I I I I I III I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 832 G AGAAGG C AAAAACT CT AC T CAT AGAT AGAGAT T T AAC AGAGT T T T C AG AAT TAG AAT AC 891 

Qy 181 T C AGAAAT GGAAT CAT CAT T CAGT G GCT CT CAAAAG G C AGAAC C T G C C GT AAC AGT AG C G 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I II MINI 
Db 8 92 T C AGAAAT GG GAT CAT C GT T CAGT GT CT CT C CAAAAG C AGAAT C T G C C GT AAT AGT AG C A 951 

Qy 241 AAT C CT AG GGAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAG AG GAC T T AGT T AGT C T T 3 00 

I I I I II I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 952 AAT C CT AG GGAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAG AGAAGT T AGT T AGT AAT 1011 



Qy 



3 01 AACAT C C T T CAT AC T C AGC AG GAGT TAT C T ACAGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 
I I I I I I I I I I I I I III II I I I I II I I I I I I I I I I I I I I I III I I I I M 



Db 1012 AACAT C CT T C AT AAT CAACAAGAGT T AC C T ACAGC T CT T ACT AAAT T G GT T AAAGAGG AT 1071 

Qy 361 GAT AGAGT T CT GT C T C C AGAAAAAAC AAAGGAC AGT T T T AAG GAAAAG G GAGT T G CAG C A 420 

II Mil I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1072 GA AGT T GT GT C T T C AGAAAAAG C AAAAGAC AGT T T T AAT GAAAAGAGAGT T GCAGT G 1128 

Qy 421 GAAGCT T CT AT GG GG GAG GAAT AT G C AGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 4 80 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1129 GAAG C T C C TAT GAG G GAG GAAT AT G C AGAC T T CAAAC CAT T T GAG C GAGT AT G G GAAGT G 118 8 

Qy 4 81 AAAGATACTTACAAGCAAGATAGT GAT GTTT T GAT T GCT GGAGGTAAT ATAGAGAGCAAA 540 

I I I I I I I I III I I I II I I I I I I I III I I I I I I I I I I I I II I I I I I I I I 
Db 118 9 AAAGATAGT AAG GAAGAT AGT GAT AT GT TGGCTGCTG GAG GTAAAAT C GAGAGC AAC 12 45 

Qy 541 T T GGAAG GTAAAGT G GAT AAGAAAC ACT T TT CAGAT AG C CT T GAAC AAACAAAT C GT GAA 600 

I I I I I I I I I I I I I I I I I I I III III I I I I I I I I I I I I I I I II I I II I III 
Db 1246 T T GGAAAGT AAAGT GGAT AAAAAAT GT T T T G CAGAT AGC C T T GAGCAAACTAAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGCAGT AAT GAT GAC ACT T CAT T T C C C AGTAC AC C AGAAGCT GT AAGA 660 

I I I I I I I I I I I II I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III 
Db 1306 AAAGATAGT GAGAGTAGTAAT GATGAT ACTT CTTT C CC CAGTAC GC CAGAAGGTATAAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAAC T GAGAAT GT T 717 

I I II I I I I I I II I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I II 
Db 1366 GAT C GT C CAG GAGC AT AT AT C AC AT GT GCTCCCTT T AACC C AGC AG C AACT GAGAGCAT T 1425 

Qy 718 T CAACAAACAT TTTT CCCTT GTTGGAAGATCAT ACTT C GGAAAATAAGACAGAT GAAAAA 777 

I I I I I I I I I I I I I I I I Mill I II II I II I II I II II I II II I I I II I II I M 
Db 142 6 G CAACAAACAT TTTTCCTTTGT TAG GAGAT C CT AC T T CAGAAAAT AAGAC C GAT GAAAAA 14 85 

Qy 77 8 AAGATAGAA- AAAAAAAG G C ACAAAT T GTAACAGAGAAGAAT GC AAGT GT CAAGACAT C A 8 36 

I I II I I II MM I I I I I I II II I II I I II I II I I I M I II III I II I I I 
Db 14 8 6 AAAATAGAAGAAAAGAAG GC C CAAAT AGT AAC AGAGAAGAAT AC TAG C AC CAAAACAT C A 1545 

Qy 837 AAC C CT TT C CT T AT GGCAG C AC AG GAGT CTAAGAC AGAT T AC GT TACAAC AGAT C AT GT G 8 96 

I II II II I III I I II I I I I I II I II I I II I I I II II II I I I I M I I I II I 
Db 154 6 AAC CCTTTTCTT GT AGCAGC AC AGGAT T CT GAG AC AGAT TAT GT CACAAC AGAT AATT T A 1605 

Qy 8 97 T CAAAGGT GAC C GAGGAAGT AGT GG CAAACAT G C CT GAAGGT CTAAC CC CAGAT T T GGT T 956 

I II I I I II I I II I I I I II I II I II I II II II I II I I II II II I I I I I II I II 
Db 1606 ACAAAGGT GACT GAGGAAGT C GTGG CAAACAT GC CT GAAGGCCT GACTC CAGAT TTAGT A 1665 

Qy 957 CAG GAAG CAT GT GAAAGT GAAT T GAAT GAAG C TACT GGT ACAAAAAT T GC CT T T GAAAC A 1016 

I I I II I I I I I I I II I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I 
Db 1666 CAG GAAG CAT GT GAAAGT GAAT T GAAT GAAGTT AC T G GT ACAAAGAT T G CT T AT GAAAC A 1725 

Qy 1017 AAAAT G GAC C T G GTT CAAAC T T C AGAAG CT GT G CAGGAGT CACT T T AC C CT GT AAC AC AG 1076 

I I I I I I I II II II II II II II II I II I I II I I I I II II I II I I I I I I I I I I 
Db 172 6 AAAAT G GACT T G GT T CAAACAT C AGAAGT T AT G CAAGAGT CACT CT AT C C T GC AGCAC AG 17 85 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I II I I II I I II I I I I I I II II I I I I I II II II I I I I I I I I I I II I I II I I I I 
Db 17 8 6 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I II I II I I I I I I I I I I II I I I I II I II I I I I I II I I I II I I I I I I I I IN 
Db 1846 AT G GAAGCAC CAT T GAAT T CT G CAGT T C C T AGT G C T G GT GCT T C C GT GAT AC AGC C CAG C 1905 



Qy 1197 T CAT C AC CAT T AGAAAC TCTTCCTT C AGT T AAT TAT GAAAGC AT AAAGT T T GAG C CT GAA 1256 

I II I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I II I I I I 

Db 1906 T CAT C AC CAT T AGAAG C T T CT T C AGT TAAT TAT GAAAG C ATAAAAC AT GAGC C T GAA 1962 

Qy 1257 AAT C C C CCACCAT AT GAGGAGGC CAT GAAT GT ATCACTAAAAAAAGAAT CAGGAAT GAAT 1316 

II I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1963 AAC C C C C C AC CAT AT GAAGAG G C CAT G AGT GT AT CACT AAAAAAAGT AT C AG GAATAAAG 2022 

Qy 1317 GAAGAAAT CACAGAGCCT GAAGGTATT AGT GTAGCTGTT CAGGAAACAGAAGCT C CTTAT 137 6 

I I I I I II I I I II I I I I I I I I I I I I II I I I I II I I I I I I I I I I I II I I I I I II 
Db 2023 GAAGAAAT T AAAGAGC CT GAAAAT AT TAAT GCAGCT C T T C AAGAAAC AGAAGC T C CT T AT 2082 

Qy 1377 AT AT CT AT T GC AT GT GAT T TAAT T AAAGAAAC AAAGAT CT C TAC T GAAC C GAC T C CAGAT 1436 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I III I I I I I II I I I I III 
Db 2083 AT AT CT AT T G CAT GT GAT T TAAT T AAAGAAACAAAGC T T T CT GC T GAAC C AGC T C C GGAT 2142 

Qy 1437 T T C T CT AGT TAT T C AGAAAT AG CAGAAGT T GCACAG C C AGT GC C C GAG CAT T C T G AGCT A 1496 

I I I I I I II I I I I I I I I I I III I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I 

Db 2143 T T C T C T GAT TAT T C AGAAAT G G CAAAAGT T GAACAGC CAGT GC CT GAT CAT T C T GAGCT A 2202 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GAC T T ATT T AGT GAT GAT T CAAT AC C C 1556 

II I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I I I I I I I II I I I I II I I I I I I I I 

Db 2203 GT T GAAGAT T C CT C AC CT GAT T CT GAAC CAGT T GAC T TAT T T AGT GAT GAT T CAAT AC C T 2262 

Qy 1557 GAAGT T C C AC AAAAAC AAGAT GAAG C T GT AAT ACT T GT GAAAGAAAAC CT CACT GAAAT T 1616 

II I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I II I I I II I I I 
Db 2263 GAC GT T C C AC AAAAAC AAGAT G AAAC T GT GAT G C T T G T G AAAGAAAGT C T C AC T GAG AC T 2322 

Qy 1617 T CAT C T GAGT CAAT GAC AGGAC AT GAC AATAAG G GAAAACT CAGT G C T T C AC CAT C AC C T 1676 

I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 2323 T CAT T T GAGT CAAT GAT AGAAT AT GAAAAT AAGGAAAAACT CAGT GCT T T GC CACCT 2379 

Qy 1677 GAGG GAG GAAAAC C GT AT T T G GAGT CT T T T C AGC C CAGT TT AGGC AT C AC AAAAGAT AC C 1736 

I I I I II I I I I I II I I I I I I I I MINI III I I I II I I I I I I I II I I I I I I I I 
Db 2380 GAGGGAG GAAAG C CAT AT T T G GAAT CT T T TAAG CT CAGT TTAGAT AAC AC AAAAGAT AC C 2439 

Qy 1737 T T AGCAC C T GAT GAAGTT T C AGC AT T GAC C CAAAAG GAGAAAAT C CCT T T GC AGAT GGAG 17 96 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2440 CT GTTACCT GAT GAAGTT TCAAC AT T GAGCAAAAAGGAGAAAATT CCTTT GCAGAT GGAG 2499 

Qy 1797 GAG CT CAAT ACT GC AGT T TAT T C AAGT GAT G G CT T AT T CAT T G CT C AGGAAG CAAAC C T A 1856 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II MINIM I II 
Db 2500 GAGCT CAGT ACT GCAGTTTATT CAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2559 

Qy 18 57 AGAGAAAGT GAAACATTTT CAGAT T CAT CT C C GAT T GAGAT TAT AGAT GAGT T CC C GAC C 1916 

I I I I I II MINI I I I I II I I I I I II I I I I I I I I I I II I I I I I I II II I I II II 

Db 2560 AGAGAAACT GAAAC GT TT T CAGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C C TAC A 2619 

Qy 1917 T T T GT CAGT T C T AAAGCAGAT TCTTCTCC T ACAT TAGC CAGG GAAT AC AC T G AC CT AGAA 1976 

II I II I I I I I II I I I I I I I I I III I I I I I I I I I I II II I I I I I II II I I II 

Db 2 620 T T GAT CAGT T C T AAAACT GAT T CAT T T T C T AAAT T AGC CAGGGAAT AT AC T GAC C T AGAA 2 679 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I II I I II I I I I I I I I I I II I I I I I I I I I I I I M M I I I I I I I II II M 

Db 2680 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2739 



2037 GG AT T G C C C CAT GAC CT T T C T T T C AAGAGT AT ACAAC C TAAAGAGGAAGTTCAT 2090 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II- I I I I 

2740 GAAT T GC C C CAT GAC CTTTCTTT GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT CAGT 27 99 

2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

II I I I i II II III II II II I I I I I I I I I I I I II I I I I I I I I 

2 8 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2 859 

2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2860 GAT GT TTCTGCTTTGGC C ACT CAAGCAGAG AT AGAGAG C AT AGT T AAAC C CAAAGT T C T T 2919 

2211 GT GAAAGAAG C CGAGAGAAAACT T C C T T CT GAT ACAGAAAAAGAG CGAAGAT C T C CAT CT 2270 

I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I I I II I I I I I I I I 

2920 GT GAAAGAAGCT GAGAAAAAAC TTCCTTCC GATAC AGAAAAAGAGGAC AGAT C AC CAT CT 2 979 

2271 GCT AT AT T T T C AG C AGAG C T GAGT AAAAC TT CAGT T GT T GAC C T C CT CT ACT G GAGAGAC 233 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2980 GC T AT AT T T T C AGC AGAGC T GAGT AAAACTT CAGT T GT T GAC CT C C T GT ACT G GAGAGAC 3039 

2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 23 90 

I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I i I I II I I I I I I II 

3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

I I I I I I I I M II I I I I I II I I I I I I I I I I I I 1 I I I I I I II I I I I I I I I II I I II I I I 

3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

2451 T T TAG GAT AT AT AAGG GT GT GAT C C AG G CT AT C C AGAAAT C T GAT GAAG GC C AC C CAT T C 2510 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3160 T T TAG GAT AT AC AAGG GT GT GAT C C AAG CT AT C C AGAAAT C AGAT GAAG GC C AC C CAT T C 3219 

2511 AG GGC AT AT T T GGAAT CT GAAGT T G C TAT AT CT GAGGAGT T GGT T C AGAAGT ACAG C AAT 2570 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I III 

3220 AGGGC AT AT CT GGAAT CT GAAGT T G C TAT AT CT GAG GAGT T G GT T C AGAAGT ACAG T AAT 3279 

2571 TCTGCTCTTGGT CAT GT T AACT GC ACAATAAAAGAACT CAGAC GC C T CT T C T T AGT TGAT 2 630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I M I 

32 8 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 690 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3340 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3399 

2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I 

34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3459 

27 51 AT T TAT GAAC G GCAT C AGGC GCAAAT AGAT CAT TAT CT GG GAC T T GCAAAT AAGAAT GT T 2 810 

I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I 

34 60 AT T TAT GAAC G GCAT C AGGC GCAGAT AGAT CAT TAT CT AG GAC T T GCAAAT AAGAAT GT T 3519 

2811 AAAG AT G CT AT GGCT AAAAT C CAAG CAAAAAT C C CT GGAT T GAAGC GT AAAG C T GAAT GA 2 87 0 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I 

352 0 AAAG AT G CT AT GGCT AAAAT C CAAG CAAAAAT C C C T GGAT T GAAGC G CAAAG CT GAAT G A 3579 
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Query Match 58. 0%; 

Best Local Similarity 87.5%; 
Matches 2519; Conservative 



Score 2223.6; 
Pred. No. 0; 
0; Mismatches 



DB 17; 

339; Indels 



Length 357 9; 

22; Gaps 



7; 



Qy 



Db 



1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 
II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I II I I I I I I I I II I I I I II I II II 
712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 GT ACT G C C C AC T GAAG GAACAC T T C C AGCAACT T CAAAT GAAG C T T C T AAAGC AT T C T C A 120 
III I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I II I I I I I I I I I 
772 GT AT TAC C C AC T GAAGGAACAC TT CAAGAAAAT GT CAGT GAAGCT T CTAAAGAGGT CT C A 831 



121 



180 



GAGAAGG C AAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T T T C AGAAT T G GAAT AT 

I I I I I I I I I I I I I II I I I I I I III I II li I I I I I I I I I I I I I I II I I I I I 
8 32 GAGAAGGCAAAAACT CTACT CATAGATAGAGATTTAACAGAGTTTT CAGAATTAGAAT AC 8 91 



181 



892 



241 



952 



301 



T CAGAAAT G GAAT CAT CAT T CAGT G G CT C T C AAAAGGC AGAAC CT G C CGT AAC AGT AGC G 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I II I I I I I I I I I II I I 

T CAGAAAT G G GAT CAT C GT T CAGT GT CT CT C C AAAAG C AGAAT CT GC C GT AAT AGTAG C A 951 

AAT C CT AG G GAC GAAAT AGT T GT GAG GAGT AGAGAT AAAGAAG AGGACT T AGT T AGT C T T 300 

I I I I I I I I I I I MINI I I I I I I II I I II I I I I I I I I I I I I II I I I I I 

AAT C CT AG G GAAGAAAT AAT C GT GAAAAAT AAAGAT GAAGAAGAGAAGT T AGT T AGT AAT 1011 



360 



AAC AT C CT T CAT ACT C AG C AGGAGT T AT C T ACAGT C CT T AC GAAAT CAGT T GAAGAAGAA 

I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I III MM II 

1012 AACATC CTT CATAAT CAACAAGAGTTAC CTACAGCT CTTACTAAATT GGTTAAAGAGGAT 1071 



361 



420 



GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 

M I I I I I I II I I I II I I II MM II II I I I I II I I I I I I I II I I I I I I I 

1072 GA AGT T GT GT CT T C AGAAAAAG CAAAAGACAGT T T T AAT GAAAAG AGAGT T GCAGT G 112* 



421 



480 



GAAGCT T CT AT GG GG GAG GAAT AT G C AGACT T CAAAC C AT TT GAG C GAGT AT GG GAAGT G 

II I II I I I II I II I I I II II II M I I I I I I I I I I I I I I I I II I I I I I M I II II I I I I 

1129 GAAGCT CCTAT GAGGGAGGAATAT GCAGACTTCAAAC CAT TT GAGC GAGT AT GGGAAGT G 118E 



Qy 4 81 AAAGAT ACT T ACAAG CAAG AT AGT GAT GT T T T GAT T GCT GGAGGTAAT AT AGAGAGCAAA 540 

I I I I I I I I III I I I I I I I I I I I I Ml I I I I I I I II I I I II I I I I I I I I 
Db 1189 AAAGATAGT AAGGAAGAT AGT GAT AT GTTGGCTGCT GGAG GTAAAAT C GAGAGC AAC 1245 

Qy 541 T T GGAAGGT AAAGT G GATAAGAAAC ACT T T T CAGAT AGC C T T GAACAAAC AAAT C GT GAA 600 

I I I I I I I I I II I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 124 6 T T GGAAAGTAAAGT GGAT AAAAAAT GT T T T GCAGAT AG C CTT GAG C AAACT AAT C AC GAA 13 05 

Qy 601 AAAGATAGT GAAAG C AGTAAT GAT GAC AC T T CAT T T C C C AGT AC AC C AGAAG C T GT AAGA 660 

I I I I II I II I I II I I I II I I I I I I I I I II II I I I I II I I I I I I I I I I III 
Db 1306 AAAGATAGT GAGAGT AGT AAT GAT GAT ACT TCTTTCCC CAGT AC G C C AGAAGGT ATAAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AACAACT GAGAAT GT T 717 

I I II I I I I I I II I I I I I I II I I I I II I I I I I I I II I I I I II I I I II 

Db 1366 GAT C GT C CAGGAGC AT AT AT C AC AT GT GCTCCCTT TAAC C CAG C AGCAAC T GAGAG CAT T 1425 

Qy 718 T C AAC AAAC AT TTTTCCCTT GT T GGAAGAT CAT AC T T C G G AAAAT AAGAC AGAT G AAAAA 777 

II I I I II I II I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I 
Db 1426 GCAACAAACATT TTT CCTTT GT TAGGAGAT CCT ACT T CAGAAAAT AAGAC C GAT GAAAAA 14 8 5 

Qy 778 AAGATAGAA- AAAAAAAGGCACAAATT GTAACAGAGAAGAAT GCAAGT GT CAAGACAT CA 8 36 

II I I I I I I I I I I I I I I I I I II I II I I I I I I I M I I I I I II III I I I I I I 

Db 14 8 6 AAAAT AGAAGAAAAGAAGGC C C AAAT AGT AAC AGAGAAGAAT AC TAG C AC C AAAAC AT C A 1545 

Qy 837 AAC C CTTT CCTTAT GGCAGCACAGGAGT CTAAGACAGATT ACGTTACAACAGAT CAT GT G 896 

I I I I I I I I III I I I I I I II I I II III I II I I I I I I I I I I I I I I I I I I II I 
Db 1546 AAC CCTTTTCTT GT AG CAG C AC AG GAT T C T GAGAC AG AT TAT GT C AC AAC AG AT AAT T T A 1605 

Qy 897 T CAAAGGT G ACC GAG GAAGT AGT G GCAAAC AT G C C T GAAG GT CT AAC C C CAGAT T T GGT T 956 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M II II I I I I II I I II 
Db 1606 ACAAAGGT GACT GAGGAAGT C GT G GCAAAC AT GC CT GAAGGC CT GACTC CAGATT TAGT A 1665 

Qy 957 CAGGAAGCAT GT GAAAGT GAATTGAAT GAAGCTACTGGTACAAAAAT T GC CTTT GAAACA 1016 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I II I I I I I 
Db 1666 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGT T AC T GGT ACAAAGAT T G CT T AT GAAACA 1725 

Qy 1017 AAAAT GGAC CT G GT T CAAACT T C AGAAG CT GT GCAG GAGT CACT T T AC CCT GT AACAC AG 107 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I ■MIM- II I I II I I I I I I 

Db 1726 AAAAT GGACT T G GT T C AAAC AT C AGAAGT TAT GCAAGAGT CACT CT AT CCT GCAG CAC AG 1785 

Qy 1077 CTTT GCCCATCTTTTGAAGAATCTGAAGCTACTCCGT CAC CGGT TTT GCCTGACATTGTC 1136 

II II I II II I I I II II I I I II I I M II II I I I I I I I I I I II I II I I I I I I II I I 

Db 17 8 6 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 18 45 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

II I I I II II I II M I I II II I I II I I I M I II II M II II I I I I I II I III 
Db 1846 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 T CAT CAC CAT T AGAAAC TCTTCCTT C AGT T AAT TAT GAAAG CAT AAAGT T T GAG CCT GAA 1256 

II I II I II I I I I II I I I II I M I II M I I II I I I I I I II I II II II II I I I I 
Db 1906 T CAT CAC CAT T AGAAG C T T CT T C AGT T AATT AT GAAAG CAT AAAAC AT GAG CCT GAA 1962 

Qy 1257 AAT C C C C CAC CAT AT GAGG AG G C CAT GAAT GT AT CAC T AAAAAAAGAAT CAG GAAT GAAT 1316 

M II II II I II II I II I || I I I II II M I II II I II II II I II II II I I I II II 
Db 1963 AAC C C CCCAC CAT AT GAAGAGGCC AT GAGT GT AT CACT AAAAAAAGT AT CAGGAAT AAAG 2022 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCT CCTTAT 137 6 



I I I I I I I I I I I I I I I I I II I I I I I II INI I I I I I I I I I II II I I I I I II I I 

Db 2023 GAAGAAAT TAAAGAG C CT GAAAAT AT T AAT G C AG C T CT T CAAGAAAC AGAAGCT C C T TAT 2082 

Qy 1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAj^AGATCTCTACTGAACCGACTCCAGAT 1436 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I III 
Db 208 3 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAG7\AGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 1496 

I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 2143 T T C T CT GAT TAT T CAGAAAT GG CAAAAGT T GAACAGC CAGT GCCT GAT CAT T CT GAGCT A 2202 

Qy 14 97 GT T GAAGAT TCCTCCCCC GAT T CT GAAC CAGT T GACT T AT T T AGT GAT GAT T C AAT ACC C 1556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 22 03 GTT GAAGATT C CT CACCTGAT TCT GAAC CAGT TGACT TATTTAGT GAT GATT CAATACCT 22 62 

Qy 1557 GAAGT T C C ACAAAAACAAGAT GAAG C T GTAAT AC T T GT GAAAGAAAAC C T C ACT GAAAT T 1616 

II I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 2263 GAC GT T C C ACAAAAACAAGAT GAAAC T GT GAT GC T T GT GAAAGAAAGT C T CACT GAGAC T 2322 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCTT CACCAT CACCT 1676 

I I II I I I I I I II I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I 
Db 2323 T CATTT GAGT CAAT GATAGAATAT GAAAAT AAGGAAAAACT CAGT GCTTT GC CACCT 2379 

Qy 1677 GAGG G AGGAAAAC C GT ATT T G GAGT CT T T T C AGC C C AGT T T AGGC AT C ACAAAAGAT AC C 1736 

I I I I I I I I I I I II I I 11 I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 238 0 GAGG GAGGAAAGC CAT ATTT GGAAT CT T T T AAGCT C AGTT T AGAT AAC AC AAAAGAT AC C 2439 

Qy 1737 T TAG CAC CT GAT GAAGT TT CAGC AT T GAC C CAAAAGGAGAAAAT C C C T T T GC AGAT GGAG 17 96 

I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 244 0 CT GT TAC CT GAT GAAGT TT CAACATT GAGCAAAAAGGAGAAAATT CCTTT GCAGAT GGAG 24 99 

Qy 17 97 GAGC T CAAT AC T GCAGT T TAT T CAAGT GAT G GCT TAT T CAT T GCT C AGGAAG CAAAC C T A 1856 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III II I I I I I I I I I II 
Db 250 0 GAGCT CAGT AC T GCAGT T TAT T CAAAT GAT GACT TAT T TAT T T CT AAG GAAGC AC AGAT A 2559 

Qy 1857 AGAGAAAGT GAAACAT T TT CAGAT T C AT CT C C GAT T GAGAT TAT AGAT GAGT T C C C GAC C 1916 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2560 AGAGAAACT GAAAC GT T TT CAGAT T C AT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT AC A 2 619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 262 0 T T GAT CAGT T CT AAAAC T GAT T CAT T T T C T AAAT TAG C CAGG GAAT AT ACT GAC CT AGAA 2 67 9 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2036 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2680 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2739 

Qy 2037 G GAT T GC C C CAT GAC CTTTCTTT CAAGAGT AT ACAAC C TAAAGAG GAAGT T CAT 2090 

I I I I I I I M I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I 

Db 274 0 GAAT T GC C C CAT GAC CTTTCTTT GAAGAAC AT AC AAC C C AAAGT T GAAG AG AAAAT CAGT 27 99 

Qy 2091 GT CC CAGAT GAGT TCTCCAAAGATAGGGGT GAT GTTTCAAAGGTGCCCGTACT GCCT CCA 2150 

II I I I I I I I II II III II II II I II I I II I I I I I II I I I I I I I I 

Db 2800 TTCT CAGAT GACTTTTCTAAAAATGGGTCTGCTACATCAAAGGT GCT CTTATT GCCT CCA 2859 

Qy 2151 GAT GTTTCTGCTTTG GAT GCT CAAG C AGAGAT AGG C AG C AT AGAAAAAC C C AAAGT T C T T 2210 

I I I I II I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I M 



Db 28 60 GAT GT T TCTGCTTT G GC CAC T CAAG C AGAGAT AGAGAG C AT AGT T AAAC C CAAAGT T CT T 2919 

Qy 2211 GT GAAAGAAGC CGAGAGAAAACT T C C T T C T GAT ACAGAAAAAGAGC GAAGAT CT C CAT CT 227 0 

I I i I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 292 0 GT GAAAGAAGC T GAGAAAAAAC TTCCTTCC GAT ACAGAAAAAGAGGAC AGAT CAC CAT C T 297 9 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 
Db 2980 G C TAT AT T T T CAGCAGAG CT GAGT AAAAC T T C AGT T GT T GAC C T C C T GT AC T G GAGAGAC 3039 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

I I I I I I I I I I I I I I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I II 
Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 2451 T T TAG GAT AT AT AAGG GT GT GAT C C AG GC T AT C C AGAAAT C T GAT GAAG G C C AC CC AT T C 2510 

I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3160 TTTAGGATAT ACAAGGGT GT GAT CCAAGCTAT C CAGAAAT CAGAT GAAGGC CACCCATT C 3219 

Qy 2511 AGG GC AT AT T T G GAAT C T GAAGT T GC T AT AT CT GAG GAGT T GGT T C AGAAGT AC AGC AAT 2570 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 3220 AGGGCAT AT CT G GAAT C T GAAGT T GC T AT AT C T GAG GAGT T GGT T CAGAAGT AC AGT AAT 3279 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 3280 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GAT T TAGT T GAT T CTCT GAAGT TTGCAGTGTT GAT GTGGGTATTTACCTATGTT GGT GCC 2690 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I 

Db 3340 GAT T TAGT T GAT T CT C T GAAGT T T GC AGT GT T GAT GT G GGT AT T TAC CT AT GT T GGT GC C 3399 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

Db 3400 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3459 

Qy 2751 AT T TAT GAAC G GC AT C AG G C GCAAAT AGAT CAT TAT C T GG GAC T T GCAAAT AAGAAT GT T 2 810 

I I I I I I I II I I I I I I I II I I I I I I I I I I 11 I I I I I I I I I I II I I I I I I II I I I I I I I I 
Db 3460 AT T TAT GAAC G GC AT C AG G C GC AGAT AGAT CAT TAT C T AGG ACT T GCAAAT AAGAAT GT T 3519 

Qy 2811 AAAGAT GC T AT G G CT AAAAT C CAAG CAAAAAT C C CT GGAT T GAAG C GT AAAGCT GAAT GA 2 870 

I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 1 I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3520 AAAGAT GC T AT GG CTAAAAT CCAAG CAAAAAT C C CT GGAT T GAAGC G CAAAG C T GAAT GA 3579 
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Query Match 58.0%; Score 2223.6; DB 17; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I II I I I I I I MM II II I I I M II II I M II II MM II M 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GT ACT GC C CACT GAAG GAAC ACT T C C AGCAAC T T CAAAT GAAGCT T CT AAAGC ATT C T C A 12 0 

Ml I I II I I I II II I II I II I I I II II I I I II M II I I I I I II I I I M 
Db 772 GT AT T AC C C AC T GAAGGAAC AC T T CAAGAAAAT GT C AGT GAAGC T T CTAAAGAG GT CT C A 831 

Qy 121 GAGAAGG CAAAAAAT C CAT T T GT AGAGAGAAAT T T AAC AGAAT T TT CAGAATT GGAAT AT 180 

I I I I M I I I I I II II I I I II I III I I II I I II I I I I I I I I I I I I I Mill 
Db 832 GAGAAGGCAAAAACT C TACT C AT AGAT AGAGATT T AAC AGAGT T T T CAGAAT T AGAAT AC 891 

Qy 181 T CAGAAAT GGAAT CAT C ATT C AGT G GCT C T CAAAAG GC AGAAC C T GC C GTAAC AGT AG C G 24 0 

I I II I I II II I I II II I I I II I I Mill III I II I II I M II I I I I I I I I I I 
Db 892 T CAGAAAT GGGAT CAT C GTT CAGT GTCT CTC CAAAAGC AGAAT CT GCCGTAAT AGTAGC A 951 

Qy 241 AAT C C T AG GGAC GAAAT AGT T GT GAG GAGTAGAGATAAAGAAGAGGACT T AGT T AGT C T T 300 

I I I II II I II I II II I I I II I I I II II I I II II I M I I I I I M I II I I 

Db 952 AAT CCTAGGGAAGAAATAAT CGT GAAAAATAAAGAT GAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 301 AACAT C C T T CAT AC T CAG C AGGAGT T AT CT ACAGT C C T T AC GAAAT CAGT T GAAGAAGAA 360 

M M I I II I I II I Ml M II I I I I II I I II Mill I II I III MM II 

Db 1012 AACAT C C T T CAT AAT CAACAAGAGT T AC CT ACAG CT CT T ACT AAAT T GGT T AAAGAG GAT 1071 

Qy 361 GAT AGAGT TCTGTCTC C AGAAAAAACAAAGGAC AGT T T T AAGGAAAAGG GAGT T GC AGC A 420 

II MM II II I I M I I II I I I I I I II II II II II II II I I I II I I I I I I 

Db 1072 GA AGT T GT GT C T T C AGAAAAAG C AAAAGAC AGT T T T AAT GAAAAGAGAGT T G CAGT G 112 8 

Qy 421 GAAGCT T CT AT GG G GGAG GAAT AT GC AGACT T CAAAC CAT T T GAG C GAGT AT G GGAAGT G 480 

I I I I M Mill M I I II I I I I I I I II II I I I I I M II M II I II I I I I II I I I I I II I 
Db 112 9 GAAG CT C CT AT G AGGGAG GAAT AT GC AGACT T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 1188 

Qy 481 AAAGAT ACT T AC AAGCAAGAT AGT GAT GT TT T GAT T G C T G GAGGTAAT AT AGAGAG CAAA 54 0 

I I II I I I I III I I I I I II I I I I I III I I II II II I I M M I I I I I I I I 
Db 118 9 AAAGATAGT AAG GAAGAT AGT GAT AT GTTGGCTGCT GGAG GTAAAAT C GAGAG CAAC 1245 

Qy 541 T T G GAAGGT AAAGT G GAT AAGAAAC AC T T TT C AGAT AG CC T T GAACAAAC AAAT C GT GAA 600 

I II M I II I II M II I I II III M I I II I M II I I I I I I I I I I I I II III 
Db 1246 T T G GAAAGT AAAGT GGAT AAAAAAT GT T T T G CAGAT AG C C T T GAGCAAACT AAT C AC GAA 1305 

Qy 601 AAAGATAGT GAAAGC AGT AAT GAT GAC AC T T CAT T T C C CAGT AC AC C AGAAG C T GT AAGA 660 

I II I II I I I M II I I I I I I I I I M I I I I I II I I I I I II I II I I I II I III 



Db 1306 AAAGAT AGT GAGAGTAGT AAT GAT GAT AC T T CT T T C C CC AGT AC GC C AGAAG GT ATAAAG 1365 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC C AAC AAC T G AGAAT GT T 717 

I I M I I I I II II I I I I I I I I II I I I II I I I I I I || I I I I I I I I | II 
Db 1366 GAT C GT C C AG GAGC AT AT AT C AC AT GTGCTCCC T TT AAC C CAG C AG C AAC T G AGAGC AT T 1425 

QY 718 T CAACAAAC AT TT T TCCCTTGTT GGAAGAT CAT ACT T C GGAAAAT AAGAC AGAT GAAAAA 777 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1426 G CAAC AAACAT TT T TCCTTTGT TAG GAGAT C CT ACT T CAGAAAAT AAGAC C GAT GAAAAA 1485 

Qy 77 8 AAGAT AGAA- AAAAAAAG G C AC AAAT T GTAACAGAGAAGAAT G CAAGT GT CAAGACAT C A 836 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I | I I I 
Db 14 86 AAAAT AGAAGAAAAGAAGGC C CAAAT AGT AACAGAGAAGAAT ACT AGC AC CAAAACAT C A 1545 

Qy 837 AACC C TT T CCT T AT G GC AG CAC AGGAGT CT AAGACAGAT T AC GT T ACAAC AGAT CAT GT G 8 96 

I I I I I I I I Ml I I II I I I I I I I I III I I I I I I I I I I I I I I I I I I II I II I 
Db 154 6 AAC CCTTTTCTT GT AGC AGCAC AGGAT T C T GAGACAGAT TAT GT C ACAAC AGAT AAT T T A 1605 

Qy 897 T CAAAGGT GAC C GAGGAAGT AGT G GCAAACAT GC C T GAAG GT C TAAC C C C AG AT TT GGT T 95 6 

I I I I I M I I I I II I I I I I I I I II I I I I I I I I I I I I I I I II II I I I I I I I I || 
Db 1606 ACAAAGGT GACT GAG GAAGT C GT G GCAAACAT GC CT GAAG GC C T GAC T C C AGAT TT AGT A 1665 

Qy 957 CAGGAAGCAT GT GAAAGT GAATT GAAT GAAGCT ACT GGTACAAAAATT GC CTTT GAAACA 1016 

I I I M II I II I I I I I II I I I I II I I I I I I I I I 1 I I I I I I I I M I I I I I I I I I I I I I 
Db 1666 CAGGAAGCAT GT GAAAGT GAAT T GAAT GAAGTTACT GGTACAAAGATT GCTTAT GAAACA 1725 

Qy 1017 AAAAT GGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

I I I I I I I I I I I I I I I I I M I I II I I I I I I I I II I I I I I I || I I | I | | | | | | 
Db 172 6 AAAAT GGACTT GGTT CAAACAT CAGAAGTTAT GCAAGAGT CACT CT AT CCT GCAGCACAG 1785 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I I I I I I I I I I I I I II I I I I II I I I I I I I I I I 1 I I I I I | | | M | | | | | | | | | | | | 
Db 1786 CT T T GC C CAT CAT T T GAAGAGT C AGAAGCT ACT C CT T CAC C AGT T TT GCCT GACAT T GT T 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I I I I I I I I I I M I I I I I I I II I I I I I I I I I II I I I I I I I II I I I MM III 
Db 184 6 AT G GAAGCAC CAT T GAAT T CT G C AGT T C CT AGT GC T GGT GC T T C C GT GAT AC AG C C C AGC 1905 

Qy 1197 T CAT CAC CAT T AGAAACT C TT C CT T C AGT T AAT TAT GAAAGC AT AAAGT T T GAG CCT GAA 1256 

I I I I I I I I M M I I I I I I I II II I I I II I II I I I II II I II I M I II I I II I 
Db 1906 T CAT CAC CAT T AGAAG C T T C T T C AGT T AAT TAT GAAAG C AT AAAAC AT GAG CCT GAA 1962 

Qy 1257 AAT C C C C CAC CAT AT GAG GAGG C CAT GAAT GTAT CACT AAAAAAAGAAT C AGGAAT GAAT 1316 

M I I I I I I I M I II II I II I I I II I I I II I II II I II I I II II II I I II II I II 
Db 1963 AAC C C C C CAC CAT AT GAAGAG GC CAT GAGT GTAT CACT AAAAAAAGT AT CAG GAAT AAAG 2022 

Qy 1317 GAAGAAAT C AC AGAG CCT GAAG GTAT T AGT GTAGC T GT T CAG GAAAC AGAAG CT CCT TAT 1376 

I I I I I I M I I I I I I I I I II I I I I I II II I I I II I I I II I I II I II I II II I I 
Db 2023 GAAGAAAT T AAAG AG CCT GAAAAT AT TAAT GC AGCT C T T CAAGAAACAGAAGC T C CT T AT 2082 

Qy 1377 AT AT C TAT T G CAT GT GAT T TAAT T AAAGAAACAAAGAT CT CT AC T GAAC C GAC T C CAGAT 1436 

I I M I I I I I M I I II II I I II I II II I II II I II I I I III I I I I I I I I I I I III 
Db 2083 AT AT CT ATT GCAT GT GAT T TAAT T AAAGAAAC AAAGC T T T C T GCT GAAC C AGCT C C GG AT 2142 

Qy 1437 T T CT C T AGT TAT T C AGAAAT AG C AGAAGT T GCACAG C CAGT GC C C GAG CAT T C T GAGC T A 1496 

I I I M I I II I I I I II I I I III I I II II II II II I II II I II I I I II I I I II II 
Db 2143 T T C T C T GAT TAT T C AGAAAT G G C AAAAGT T GAACAG C CAGT GCCT GAT CAT T C T GAGC T A 2202 



Qy 1497 GT T GAAGAT TCCTCCCCC GAT T C T GAAC C AGT T GACT TAT T TAGT GAT GAT T CAAT AC C C 1556 

I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 2203 GT T GAAGAT T C CT C AC C T GAT T CT GAAC CAGT T GAC T T ATT TAGT GAT GAT T CAAT AC C T 2262 

Qy 1557 GAAGT T C C ACAAAAACAAGAT GAAG CT GT AAT AC T T GT GAAAGAAAAC C T C AC T GAAAT T 1616 

II I I I II I I I I I I II I I I I I I I I I I I I II I I I I II I II I I I I I I I I I I I I I I 

Db 2263 GAC GT T C C ACAAAAACAAGAT GAAACT GT GAT G CT T GT GAAAGAAAGT CT C AC T GAGAC T 2322 

Qy 1617 T CAT CT GAGT CAAT GACAGGACAT GACAATAAGGGAAAACT CAGT GCT T CAC CAT CACCT 1676 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 2323 T CATTT GAGT CAAT GATAGAAT AT GAAAATAAGGAAAAACT CAGT GCTTT GC CACCT 2379 

Qy 1677 GAG GG AG GAAAAC C GT AT T T G GAGT CT T T T CAG C C CAGT T T AG GCAT C ACAAAAGAT AC C 173 6 

I I I I I I I I I I I II I I I I I I I I II I I I I III I I I I I I I I I I I I I I I I I I I I I I 
Db 2380 GAGGGAG GAAAG CCAT AT T T G GAAT CT T T TAAGCT CAGT T TAGATAAC ACAAAAGAT AC C 2 439 

Qy 1737 T T AGC AC CT GAT GAAGTT T CAG CAT T GAC C CAAAAG GAGAAAAT C C C T T T GC AGAT GGAG 1796 

I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 24 4 0 CT GTT ACCT GAT GAAGTTTCAACAT T GAGCAAAAAGGAGAAAATT C CTTT GCAGAT GGAG 2499 

Qy 1797 GAGCT CAAT ACT GCAGTT TAT T CAAGT GAT GGCT TAT T CAT T G CT C AGGAAGCAAAC C T A 1856 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II II I II I I I I I I I I I II 
Db 2500 GAGCT CAGT ACT GCAGTTTATT CAAAT GATGACTTAT TTATTT CTAAGGAAGCACAGAT A 2 559 

Qy 1857 AGAGAAAGT GAAACAT TT T CAG AT T CAT CT C C GAT T GAGAT TAT AGAT GAGTT C C C GAC C 1916 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2560 AG AGAAAC T GAAAC GT T T T CAG AT T CAT C T C CAAT T GAAAT TAT AGAT GAGT T C C C T AC A 2619 

Qy 1917 T T T GT C AGT T CT AAAGCAG AT TCTTCTCC T ACAT TAG C C AGGGAATAC ACT GAC CT AGAA 197 6 

II I I I I I I I II I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I II II I I I I 

Db 2620 T T GAT CAGT T CT AAAACT GAT T CAT T T T C TAAAT TAG C CAG GGAATAT AC T GAC CT AGAA 2 679 

Qy 1977 GT AGC C CACAAAAGT GAAAT T GC T GAC AT C CAGGAT G G AGC T G GGT CAT T GGC T T GT GC A 2036 

III II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 

Db 268 0 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2739 

Qy 2037 G GAT T G C C C CAT GAC CTTTCTTT C AAG AGT AT AC AAC C T AAAG AG GAAG T T CAT 2090 

I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I Mill II I I 

Db 274 0 GAATT GCCCCAT GACCTTTCTTT GAAGAACATACAACC CAAAGTT GAAGAGAAAAT CAGT 27 99 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

I I I II I I I I I I I I II I I I I I III I I I I I I I I I I I I I I I I I I I I I 
Db 2800 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 285 9 

Qy 2151 GAT GTTTCTGCTTT GGAT GCT CAAGCAGAGAT AG G CAGC AT AGAAAAAC C CAAAGT T C T T 2210 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2860 GAT GTT T CT GCT TT GGCCACT CAAGCAGAGATAGAGAG CAT AGT TAAACC CAAAGT T CT T 2 919 

Qy 2211 GT GAAAGAAGCC GAGAGAAAACTT CCT T CT GAT AC AGAAAAAGAGC GAAGAT CT C CAT CT 227 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2920 GT GAAAGAAGCT GAGAAAAAAC TTCCTTCC GAT AC AGAAAAAGAG GAC AGAT CAC CAT CT 2979 

Qy 2271 G CT AT AT T T T CAGCAGAG CT GAGT AAAAC T T CAGT T GT T GAC C T C CT CT AC T G GAGAGAC 2330 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2980 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3039 



Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I I I I I I I I I I I I I I I I I I I I I I ! I I II I II I I I I I I I I I I I I I I I II I I II I I I I 
Db 3040 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2 450 

I I I I I I I II I I I I I Mill I II I I I I I I I I I I M I I I I I I I I I I I II I I I I I II I I I 
Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 2 451 T T T AGGAT AT AT AAG G GT GT GAT C C AG GCT AT CC AGAAAT C T GAT GAAG GC C AC C CAT T C 2510 

I I I I I I I I I I I II I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 3160 T T TAGGAT AT AC AAGG GT GT GAT C C AAGCT AT C C AGAAAT C AGAT GAAG GC C AC C CAT T C 3219 

Qy 2511 AG GG CAT AT T T GGAAT C T GAAGT T GC T AT AT CT GAGGAGT T GGT T CAGAAGT AC AG CAAT 2570 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I II I I I I Ml 
Db 3220 AGGGCATAT CT GGAAT CT GAAGT T GCT AT AT CT GAGGAGT T GGT T CAGAAGT AC AGTAAT 3279 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

I I II I I I I I I I I I II I I II I I i I I I I I II I II I I I I II II II I II II M I II M II 

Db 3280 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2 69 0 

II I I I I I I I I II I I I I I II I I I I M II I I I II I I I II I I I I M M I I M II II I II I I I I 

Db 3340 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3399 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

I II I I I II I I I I I I I M I I I II I II I II M I I I I I I I I I I I II I I II I I I I I I I I I I 

Db 3400 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3459 

Qy 27 51 ATTT AT GAACGGCAT C AGGC GCAAAT AGAT CATT AT CTGGGACTT GCAAATAAGAAT GTT 2810 

II I II I I I I I II I II I I II I I I I I I II I II I II I I I I I I I I I II I I I I II I I II I II I 

Db 3460 ATTT AT GAAC GGCAT CAGGCGCAGAT AGAT CATTAT CTAGGACTT GCAAATAAGAAT GTT 3519 

Qy 2811 AAAGAT GCT AT GGCT AAAAT C CAAGCAAAAAT CC CT G GAT T GAAG C GT AAAGCT GAAT GA 2870 

I M I I I II I II I I I I I M I I II I I I II I I I II I I I I II I I II I I II I I I I II I II I I I I 
Db 3520 AAAGAT G CT AT GGC TAAAAT C CAAGCAAAAAT CC CT G GAT T GAAGC GCAAAGCT GAAT G A 3579 
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Query Match 56.9%; Score 2179.4; DB 9; Length 4684; 

Best Local Similarity 78.2%; Pred. No. 0; 

Matches 3008; Conservative 0; Mismatches 731; Indels 110; Gaps 28; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

IIIIIIILIII.il I I I I M I I I I I I I I I II I I I I I M I I I I III I I I I I 
Db 92 8 CT AT CT C C T CT CT CAACT GTTTCTTT T AAAGAAC AT G GAT AC C T T GGT AACT TAT C AG C A 987 

Qy 61 GT ACT GC C C ACT GAAG GAAC ACT T C C AGCAAC T T CAAAT GAAG CT T CTAAAG C ATT CT C A 120 

M I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I II I I I I I I II II 

Db 98 8 GT GT CAT C CT C AGAAG GAACAAT T GAAGAAACT T T AAAT GAAG CT T CTAAAGAGTT GC C A 1047 

Qy 121 GAGAAGGCAAAAAAT C CAT T T GT AGAGAGAAATT T AACAGAAT T T T C AGAAT T GGAAT AT 18 0 

I I I I I I I I I I I I I I I I II I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 104 8 GAGAGG GCAACAAAT C CAT T T GT AAAT AGAGAT T T AGCAGAAT T T T C AGAAT T AGAAT AT 1107 

Qy 181 T CAGAAAT GGAAT CAT CATT CAGT GGCTCT CAAAAGGCAGAACCT GCCGTAACAGT AGC G 24 0 

I I I I I II M I I I I I I I I I I I II I I I I I I I II I I I II I I I I I I I 

Db 1108 T CAGAAAT GGGAT CAT C TT T TAAAG GCT C C C CAAAAGGAGAGT C AG C CAT AT T AGT AGAA 1167 

Qy 241 AAT C CT AGG GACGAAAT AGT T GT GAG GAGT AGAGAT AAAGAAGAGGACT T AGT T AGT CT T 300 

II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I 

Db 1168 AAC ACT AAG GAAGAAGT AAT T GT GAGGAGT AAAGACA AAGAG GAT T T AGT T T GT AGT 1224 

Qy 301 AAC AT C C T T CAT ACT CAGC AG GAGT T AT CT AC AGT C CT T AC GAAAT CAGT T GAAGAAGAA 360 

I I I I I I I I I I I I I I I I I II! INN 

Db 122 5 G CAGC C CT T C AC AGT C C ACAAGAAT C AC CT GT GGGTAAAGAA 12 66 

Qy 361 GAT AGAGT T CT GT C T C C AGAAAAAACAAAG GAC AGT T T T AAGGAAAAGG GAGT T GC AG C A 42 0 

M I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I II N II I I N I I 

Db 1267 GACAGAGT T GT GT C T C C AGAAAAG ACAAT G GACAT T T T T AAT GAAAT G C AGAT GT CAGT A 1326 

Qy 421 GAAGCT T CT AT GGGGGAGGAAT AT GCAGACT T CAAACCATTT GAGC GAGT AT GGGAAGT G 4 80 

I III II II I I I I II I II I II I I I I I II I I I I I II I I II I I I I I I N I I 

Db 1327 GT AG C AC CT GT GAGG GAAGAGT AT G C AGAC T T TAAGCCAT TT GAAC AAGC AT GGGAAGT G 1386 

Qy 4 81 AAAGAT ACT T ACAAGCAAGAT AGT GAT GT T T T GAT T GCT G GAG GT AAT AT AG AGAGC AAA 54 0 

II I I II II I I I II I I I I II I II N I I II I I I I I N 

Db 1387 AAAGAT ACT TAT GAG G GAAGT AGGGAT GT GC T G G CT GC T AGAGCT AAT 1434 



Qy 



541 TT G GAAGGT AAAGT G GATAAGAAAC ACT T T T CAGAT AG C CT T GAACAAAC AAAT C GT GAA 600 
I I I II II I II I I II I I III I II I I II I I I I I II II I I I I I I N 



Db 



1435 GT GGAAAGTAAAGT GGACAGAAAAT G C T T G GAAGAT AG C CT GGAG C AAAAAAGT C T T G G G 14 94 



Qy 601 AAAGAT AGT GAAAG C AGT AAT GAT GAC ACT T CAT T T C C C AGT AC AC C AGAAGC T GT AAG A 660 

II I I I I I I I I I I I I I I I I II II I I I I II II I I I II I I II I II I I I I I 
Db 14 95 AAGGAT AGT GAAG G C AGAAAT GAGGAT GCTTCTTTCCC C AGT ACC C C AGAAC C T GT GAAG 1554 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 

I I I I I I I II I I I I I I I I II I I I II I I I I I I M I I I I I II 
Db 1555 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACA 1614 

Qy 721 ACAAACAT TTTTCCCTTGTT GGAAGAT CAT ACT T C GGAAAAT AAGAC AGAT GAAAAAAAG 78 0 

I II I II III II I II I I II I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I 
Db 1615 GCAAAC ACT T TCCCTTTGT T AGAAGAT CAT ACT T C AGAAAAT AAAAC AGAT GAAAAAAAA 1674 

Qy 7 81 AT AGAA- AAAAAAAGGC ACAAAT T GT AAC AGAGAAGAAT G CAAGT GT CAAGACAT CAAAC 839 

I II I I I M I I I I I I I I I I I I I I I I I 1 I I I I I I Ml I M M I I I I I 
Db 1675 AT AGAAGAAAG GAAGG C C C AAAT T AT AAC AG AGAAGA C TAG C C C C AAAAC GT C AAAT 1731 

Qy 840 C CT T T C CT T AT G GC AG CAC AG GAGT CTAAGACAGAT T AC GT T AC AACAG AT CAT GT GT C A 8 99 

I I I I II I I I I I I I I I I I I I I III II I M I I I I I I I I I I I I I I I I I I I I 

Db 1732 CCTTTCCTT GT AGC AGT AC AGGAT T C T GAGGC AGAT TAT GT T ACAAC AGAT AC C T TAT C A 1791 

Qy 900 AAGGT GAC C GAGGAAGT AGT GGCAAACAT GC CT GAAGGT CT AACC C C AGAT T T G GT T C AG 959 

I I I I I I I I II I I II I I II I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I 
Db 1792 AAGGT GACT GAGGCAGCAGT GT CAAACAT GC CT GAAGGT CT GACGC CAGAT TT AGTT CAG 1851 

Qy 9 60 GAAGC AT GT GAAAGT GAAT T GAAT GAAGCT ACT G GT AC AAAAATT GC C T T T GAAACAAAA 1019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1852 GAAGCAT GT GAAAGT GAACT GAAT GAAGCCACAGGTACAAAGATT GCTT AT GAAACAAAA 1911 

Qy 1020 ATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTT 107 9 

Mill I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I II I I 
Db 1912 GT GGACT T GGT CCAAACAT CAGAAGCTATACAAGAAT CACTTT AC C CCACAGCACAGCTT 1971 

Qy 1080 T GC C CAT CTT T T GAAGAAT CT GAAG CT AC T C C GT CAC C GGT TT T G C CT GAC AT T GT CAT G 1139 

I I I I M I I I I I I I III I I I I I I I I II II I I I I I I I I I I I M I I I I I I I I I III 
Db 1972 T GC C CAT CAT T T GAG GAAG CT GAAG CAACT C C GT CAC C AGT TT T GC CT GAT AT T GT T AT G 2031 

Qy 1140 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 1199 

I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I MM I 

Db 2032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2091 

Qy 1200 T CAC CAT T AGAAACT C T T C CT T C AGT T AAT TAT GAAAG CAT AAAGT T T GAG C C T GAAAAT 1259 

II III I III I I I I I I II II M I II I I I M II I I I I I I II I I I I I I I I I 

Db 2 0 92 T C C C CAC T GGAAGC AC CT C C T C C AGT T AGT TAT GAC AGT AT AAAG CTT GAGC C T GAAAAC 2151 

Qy 1260 C C C C CAC CAT AT GAG GAG G C CAT GAAT G T AT C AC T AAAAAAAG AAT CAG GAAT GAAT G AA 1319 

I M II I I I M I I 11 I I I M I I II II I I I II I I I I I I I I I I I I II Ml 
Db 2152 C C C C CAC CAT AT GAAG AAG C CAT GAAT GT AG C ACT AAAAGCTTTGGGAACAAAGGAA 2208 

Qy 132 0 GAAAT CACAGAGCCT GAAGGT AT T AGT GTAGCT GTT CAGGAAAC AGAAGCT CCT TAT AT A 1379 

I III I I I I II II I I I II Ml II M II II I I I I I II I II I II II II II I I I I" II 

Db 2209 GGAATAAAAGAGCCTGAAAGTTTT AAT GCAGCT GTT CAGGAAAC AGAAGCT CCTTATAT A 2268 

Qy 1380 T C TAT T G CAT GT GAT T T AAT TAAAGAAAC AAAGAT C T CT AC T GAAC C GAC T C CAGAT T T C 1439 

II Mill I I I II I II II II II I M I I M I I I MM I I I II M I I I M I I I I II 

Db 2269 T C CAT T GC GT GT GAT T T AAT TAAAGAAAC AAAG CT CT C CACT GAGC CAAGT C CAGAT T T C 2328 



Qy 1440 T C T AGT TAT T CAGAAAT AGC AGAAGT T G C AC AG C C AGT G C C C GAG C ATT CT GAGC T AGT T 1499 

I I I I I I I I I I I II I I I I I I I II I I II I I I I I I I I I II I I I I I M I I I 

Db 2329 T C T AAT TAT T CAGAAAT AG CAAAAT T C GAGAAGT CGGTGCCC GAAC ACG CT GAGC TAGT G 23 88 

Qy 1500 GAAGATT C CT C C CCC GAT T CT GAAC CAGT TGACT TAT T TAGT GAT GATT CAAT AC C CGAA 1559 

II I I I I I I I I II II I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I II M Ml 

Db 23 8 9 GAG GAT T C CT C AC CT GAAT C T GAAC C AGT T GACT T AT T TAGT GAT GATT C GAT T C CT GAA 244 8 

Qy 1560 GT T C C ACAAAAACAAGAT GAAGCT GT AAT AC T T GT GAAAGAAAAC CT C AC T GAAAT TT C A 1619 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2449 GT C C CACAAAC ACAAGAGGAGGCT GT GAT GCT CAT GAAG GAG AGT C T C ACT GAAGT GT C - 2507 

Qy 162 0 TCT GAGTCAATGACAGGACAT GACAATAAGGGAAAACT CAGT GCT TCAC CAT CACCTGAG 1679 

I II II I I I I III I I I I I I I I I I I I I II IN 
Db 2508 T GAGACAGTAGCC CAGCACAAAGAGGAGAGACT TAGT GC CT CAC CTCAGGAG 2559 

Qy 168 0 G GAG GAAAAC C GT AT T T GGAGT C T T T T C AGC C CAGT T T AG GC AT C ACAAAAGAT AC CT T A 17 39 

I I I I II II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 2560 CTAGGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACAT AGT ACAAAAGAT GC T 2616 

Qy 1740 GCACCTGAT GAAGTTT CAGCAT T GAC CCAAAAGGAGAAAAT CC CTT T GCAGATGGAGGAG 1799 

Ml II I I I I II II I I I I II I I I I I I I I I I I I I I I I I I I I I I Mill I M 
Db 2617 G CAT C T AAT GAC AT T C C AAC AT T GAC C AAAAAG GAGAAAAT T T C T T T G C AAAT G GAAG AG 2676 

Qy 1800 CT CAAT AC T GC AGT T TAT T CAAGT GAT G GCT TAT T CAT T G CT C AG GAAGC AAAC CT AAGA 18 59 

I I I II I I I I I I II I II I I I I I I I I I I II I I I I I I I I I I II I I I I 
Db 2677 T T TAAT ACT G CAAT T TAT T CAAAT GAT GACT T ACT T T CT T C TAAGGAAGACAAAAT AAAA 27 36 

Qy I860 GAAAGT GAAACATTTT CAGATT CAT CT CCGATT GAGAT TAT AGAT GAGTT C CCGACCTT T 1919 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M M I I I 
Db 2737 GAAAGT GAAAC AT T T T C AGAT T CAT CT C C GAT T GAGAT AAT AGAT GAAT T T C C CAC GT T T 2796 



Qy 


1920 


GT C AGTT CT AAAGC AGAT T CTT CT C CT AC AT T AGC C AGGGAAT ACAC T GAC C T AGAAGT A 

I 1 1 1 M 1 1 1 1 MM II 1 1 1 1 1 1 1 1 1 1 1 1 II III 1 II II 1 M II II 1 1 1 1 1 

GT CAGT GCTAA AGAT GATT CT C CT AAAT T AGC CAAGGAGT ACAC T GAT CT AGAAGT A 


1979 


Db 


2797 


2853 


Qy 


1980 


GC C CACAAAAGT GAAAT T G CT GAC AT C CAG GAT GGAGC T GG GT C ATT G GCT T GT GCAGGA 

II M 1 II M 1 1 1 II 1 II II 1 II II 1 1 1 1 1 1 II 1 1 II 1 II 1 III 

TCCGACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAA 


2039 


Db 


2854 


2913 


Qy 


2040 


T T G C C C CAT GACCT TT CT T T CAAGAGT AT ACAAC CT AAAGAG GAAGT T CAT GT C C CAGAT 
1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
TTGCCCTGT GAC CT T T C T T T C AAGAAT AT AT AT C CTAAAGAT GAAGT AC AT GT T T CAGAT 


2099 


Db 


2914 


2973 


Qy 


2100 


GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 

II II 1 1 1 1 II II II 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 M 1 
GAAT T C T C C GAAAAT AG GT C CAGT GT AT CT AAG GC AT C CAT AT C G C C T T CAAAT GT CT CT 


2159 


Db 


2974 


3033 


Qy 


2160 


GCT T T GGAT GCT CAAG CAGAGAT AG G CAG CAT AGAAAAAC C CAAAGT T CT T GT GAAAGAA 

1 1 1 1 II II 1 1 1 1 1 1 M II II 1 1 1 1 1 1 1 1 1 II II 1 1 1 III 1 II M 1 1 

GC T T T GGAAC C T C AGAC AGAAAT GG GC AGC ATAGT T AAAT C CAAAT CAC T T AC GAAAGAA 


2219 


Db 


3034 


3093 


Qy 


2220 


GC C GAGAGAAAAC TTCCTTCT G AT AC AGAAAAAGAG C GAAGAT C T C CAT CT G C TAT AT T T 
1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 II II 1 1 1 1 II M 1 1 
GC AGAGAAAAAACT T C C T T CT GAC AC AGAGAAAGAGGACAGAT C C CT GT C AG CT GT AT T G 


2279 


Db 


3094 


3153 



Qy 2280 T CAG C AGAGCT GAGT AAAAC T T C AGT T GT T GAC C T C CT C TACT G GAGAGACAT T AAGAAG 2339 

I I | | | M I I I I I I I I I I I I I M I M I I I I I I I I I I I M I I I I I M II I I I I I I I I I I M I 
Db 3154 T C AGCAGAGCT GAGT AAAAC T T C AGT T GT T GAC C T C CT C TACT G GAGAGACAT T AAGAAG 3213 

Qy 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 2399 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I 

Db 3214 ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 3273 

Qy 2 4 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 2 4 59 

II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3274 GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 3333 

Qy 2 460 T AT AAGG GT GT GAT C C AG GCT AT C C AGAAAT C T GAT GAAG G CC AC C C ATT CAGGG CAT AT 2519 

I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I 

Db 3334 T ATAAGGGCGT GAT C CAG GCT AT CC AGAAAT CAGAT GAAGGCCACC CATT CAGGGCATAT 3393 

Qy 2520 T T GGAAT CT GAAGT T GC T AT AT CT GAGGAGT T GGTT CAGAAGT ACAG C AAT TCTGCTCTT 2579 

II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3394 T T AGAAT C T GAAGT T GCT AT AT CAGAGGAAT T GGTT C AGAAAT ACAGTAATT CTGCTCTT 3453 

Qy 2580 GGT CAT GT T AACT GC ACAAT AAAAGAACT C AGAC GC CT CT T CT T AGT T GAT GAT T T AGT T 2639 

I || I I I I I III I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I 
Db 3454 GGT CAT GT GAACAGC ACAAT AAAAGAACT GAGGC GGCT T T T CT T AGT T GAT GAT T T AGTT 3513 

Qy 2 64 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2699 

I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3514 GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 3573 

Qy 2700 GGT CT GACACT ACT AATTTT GGCT CTGATTTC ACT CTTCAGTGTTCCTGTTATTTATGAA 2 75 9 

I I I I I I I I I I I I II I I I I I I I II I I I I I I II I I I I I I I I M I I I I I M I I M I I I 1 
Db 3574 GGT CT GACACT ACT GATTTTAGCTCTGATCTCACTCTTCAGT ATT CCTGTTATTTATGAA 3633 

Qy 2760 C GGCAT CAGGC GCAAAT AGAT CATT AT CT GGGACTT GCAAATAAGAAT GTTAAAGAT GCT 2819 

I I I I I I I M I III I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 3634 CGGCAT CAGGT GCAGATAGAT CATTAT CTAGGACTT GCAAACAAGAGT GTTAAGGAT GC C 3693 

Qy 2820 AT GGC T AAAAT C CAAGCAAAAAT C C CT GGAT T GAAGC GTAAAG CT GAAT GAGAAAGC C T G 2 87 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II IN I I I I I I 

Db 3694 AT GG C CAAAAT C CAAGCAAAAAT C C C T GGAT T GAAG C GCAAAGCAGAT T GAAAAAGC C C C 3753 

Qy 288 0 AAAG AGT TAACAAT AGAG GAGT T TAT C TT T AAAG GG GAT AT T CAT T T GAT T C CAT T G G GG 2 93 9 

II I I I M I I I I I I I I I I II I I I I I I I I I I I I II I IN 

Db 3754 AA AC AGAAGT T CAT CTT T AAAGG GGAC ACT C ACT T GAT T AC GGG 3797 

Qy 2 94 0 AG GGT C AGG GAAGAACAAAG CCT T GAC AT T G C AGT G C AGT T T C ACAGAT CTT TAT T T T T A 2 999 

II I I I I I I II I I I I I I I I M I I I I I I I I I I I I I 

Db 3798 GGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT — CAGCTCTTTATTTTTA 3855 

Qy 3000 GCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAG 3 05 9 

III III III I II I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I M I I I I 

Db 385 6 GCAGTGCACTGTTTGAGGAAAAATTACCTGTCTTGACTT-CCTGTGTTTATCATCTTAAG 3914 

Qy 3060 TAT T GT AAGC T GC T AT GT AT GGAT T T AAAT C GT AAT CAT AT TTGTTTTTCCT GT AT GAG G 3119 

I I II I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3915 TAT T GTAAGCT GCT GTGTAT GGAT CT- CATT GTAGTCACACTTGT CTT CCCCAAT GAGGC 3973 



QY 



3120 C ACT G GT GAAT AAACAAAGAT C T GAGAAAGC T GT AT AT T ACAC T T T GT C G CAG GT AGT C T 3179 



Db 3974 GCCTGGTGAATAAAGGA — CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCT 4 031 

Qy 3180 T G CT GT AT T T G G GGAAT T GCAAAGAAAGT G GAGC T G AC AGAAAT AAC C C T T T T C AC A 3236 

I I I I I I I I I I I I I I I I I II I III I I I I I I I I I I I I II I I I I 
Db 4032 AGCTGTATGCAGAGAGTTGTAAAGAAGGCAAATCTGGGGGCAGGGAAAACCCTTTTCACA 4091 

Qy 3237 GT TT GT G C AC T GT GT AC G GT C T GT GT AGGT T GAT GC AGAT T T T C T GAAAT GAAAT GT T T A 32 9 6 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I II I I 

Db 4 092 G — T GTACTGTGTTTGGTCAGTGTAAAACT GAT GCAGATTTTTCT GAAAT GAAAT GTTT A 414 9 

Qy 3297 GACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 3356 

I I I I I I I I I I I I I I I I II I I I I I II I II I I M I I I I I I I I I 

Db 4150 GAT GAGAGC AT ACT ACT AAAG C AGAGT GGAAAACT C T GT C — TTTATGGTGTGTTCTAGG 4207 

Qy 3357 T GT AT T GT GAAAT T TACT GT T GT AT T AATT G C CAAT ATAAGT AAAT AT AG AT TAT AT AT A 3416 

I I I I I II I I I I I I I I I I I I I III I I I I II I I I I I I I I I I I I I II I I 
Db 4208 T GTATT GT G- AATT TACT GTTATAT T G C C AAT AT AAGT AAAT AT AGA CCTAA 4258 

Qy 3417 TCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTT 3476 

I I I I I I II I I II I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I M I I I I 

Db 4259 TCTATATATAGTGTTTCACAAAGCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTT 4318 

Qy 3477 GAT ACT T CT GT CAT GGGT T T TAT GT GT GTAGT C C CAAAGCAC ATAAG CT AG G GAGAAAC G 3536 

II I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I 

Db 4319 GAC CT C T GAGT CAT T GGT TAT - GC AGT GTAGT C CAAGC AC ATAAAC TAG GAAGAGAAAT G 4377 

Qy 3537 TACT T CT AGG C GC ACT AC C - AT CT GT T T T CAACAC GAACC GAC G C C AT GCAAAC AGAACT 3595 

I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I 

Db 4378 TAT T T GT AGGAGT GCT AC CT AC CAC C T GT TT T CAAGAAAAT AT AGAACT C CAACAAAAAT 4437 

Qy 3596 CCTCi^ACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTC 3650 

II II I I I I I I I I II I I I I I I I I M II I I I I 
Db 4438 AT AGAAT GT C AT T T CAAAGAC TT ACT GT AT GT AT AGT TAAT T T T GT CAC AGACT CT GAAA 4497 

Qy 3651 T G GAC T GAAT CTAAT GCTT C CAAAAAT GT T T GCAAAT AT CAAAC AT T GT T AT GT A 3705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 4498 TTCTATGGACTGAATTTCATGCTTCC — AAAT GT T T G CAGT TAT CAAAC AT T GT TAT G C A 4555 

Qy 37 06 AGAAAAT AT - AAAT GAC GAT T T AT ACAAT T GT G GT T T AAG CT GT AT T GAAC T AAAT CT GT 37 64 

I I I I I II II I I I I II MINI I I I I I I I I I I I II I III I I I I I I I I I I I 

Db 4556 AGAAAT CAT AAAAT GAAGACT TAT AC CAT T GT GGT T T AAG C C GT AC T GAAT T — ATCTGT 4613 

Qy 3765 G GAAT GC AT T GT GAAC T GT AAAAGC AAAGT AT CAAT AAAGC T T AT AGAC T T AAAAAAAAA 3824 

II II II I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4614 G GAAT G CAT T GT GAACT GT AAAAG C AAAGT AT CAAT AAAG CT TAT AGAT C T T AAAAAAAA 4673 

Qy 3825 AAAAAAAAA 3833 

I I I I I I I I I 
Db 4674 AAAAAAAAA 4682 

RESULT 11 
US-10-267-502-214 

; Sequence 214, Application US/10267502 
; Publication No. US2 0040071700A1 
; GENERAL INFORMATION: 



APPLICANT: Kim, Jaeseob 
APPLICANT: Galant, Ron 

TITLE OF INVENTION: Obesity Linked Genes 
FILE REFERENCE: LSD-07416 

CURRENT APPLICATION NUMBER: US/10/267, 502 
CURRENT FILING DATE: 2003-01-27 
NUMBER OF SEQ ID NOS : 439 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 214 
LENGTH: 34 92 
TYPE: DNA 

ORGANISM: Mus musculus 
US-10-267-502-214 

Query Match 48.7%; Score 1866; DB 12; Length 3492; 

Best Local Similarity 80.9%; Pred. No. 0; 

Matches 2325; Conservative 0; Mismatches 500; Indels 49; Gaps 11; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I 

Db 664 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACACGGATACCTTGGTAACTTATCAGCA 723 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 

II I I I I II I I II I I II I I I I I I I I I I I I I I I I I I II I I I I I I II 

Db 72 4 GT G GCAT C C AC AGAAGGAACT AT T GAAGAAAC T T TAAAT GAAGC TT CT AGAGAAT T G C C A 783 

Qy 121 GAGAAGGCAAAAAAT CCATTT GT AGAGAGAAATT TAACAGAATTTT CAGAATT GGAATAT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I III I I I II I I I I I I I I I III I I I I I 
Db 784 GAGAGGGCAACAAAT C CATTT GTAAATAGAGAGT CAGCAGAGTTTT CAGT ATT AGAAT AC 843 

Qy 181 T C AGAAAT GGAAT CAT CAT T CAGT GGCT CT CAAAAGG C AGAAC CT G C C GTAAC AGT AGC G 24 0 

I I I I I I I I I I I I I II I I I I I I I I I II I t I I I I I I I I I I I I I I I I 

Db 844 T C AGAAAT G G GAT CAT CTT T CAAT GGCT C C C CAAAAG GAGAGT C AGC CAT GT T AGT AGAA 903 

Qy 241 AAT C CT AGG GAC GAAAT AGT T GT GAGGAGT AGAGAT AAAGAAGAG GACT T AGT T AGTCT T 300 

II I I I I I I I I I II I I I I I I I I I II I I II I I I I I I I I I I II I I I I I 

Db 904 AACACTAAGGAAGAAGTAATT GT GAGGAGTAAAGACA AAGAGGATTTAGTTTGTAGT 9 60 

Qy 301 AACAT C CT T CAT ACT CAGC AG GAGT T AT CT AC AGT C C T T AC GAAAT C AGT T GAAGAAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I 
Db 9 61 G C AGCC C T T CAT AAT C CACAAGAGT CAC CT GC GAC C C T T ACT AAAGT GGT T AAAGAAGAC 1020 

Qy 361 GAT AGAGT T CT GT C T C C AGAAAAAACAAAGGAC AGTT T TAAG GAAAAGG GAGT T G CAG C A 42 0 

I I I I I I I I I I I I I I II I M I I I I I I II I I I I I I I I I I I I II III 
Db 1021 G GAGTTAT GT CT C CAGAAAAGACAAT GGACAT TTTTAAT GAAAT GAAAAT GT CAGT G 1077 

Qy 421 GAAG CT T C TAT G GG GGAGGAAT AT GCAGACT T CAAAC CAT T T GAGC GAGT AT G G GAAGT G 4 80 

I III II II I I I I II I I I I I I I I II II II I I I I I I I II I I I I I I I I I I 
Db 1078 GT AG C AC CT GT GAGG GAAGAGT AT GCAGAT T T T AAGC C AT T T GAACAAGC AT GGGAAGT G 1137 

Qy 4 81 AAAGAT ACT T ACAAG CAAGAT AGT GAT GT T T T GAT T GC T GGAGGT AAT AT AGAGAG CAAA 54 0 

I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I 
Db 1138 AAAGAT ACT TAT GAG G GAAGT AGGGAT GT GCTGGCTGC T AGAG CTAAT A 1186 

Qy 541 T T G GAAG GT AAAGT G GAT AAGAAAC AC T T T T C AGAT AG C C T T GAACAAACAAAT C GT GAA 600 

Mill II I I I II I II II III I I I I I I I I I I I I I II I I I I II II II 

Db 118 7 - T G GAAAGT AAAGT GGAC AAAAAAT G CT T T GAAGAT AG C C T G GAGC AAAAAAGT CAT GG G 12 45 



Qy 601 
Db 1246 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 1 I I I I I I I III 
Db 1306 GACGGCTCCAGAGCGTACATCACCTGTGATTCCTTTACCTCAGCAACCGAGAGTACTGCA 1365 

Qy 721 ACAAAC AT TTTTCCCTTGTTG GAAGAT CAT AC T T C G GAAAATAAGAC AGAT GAAAAAAAG 780 

I I I II I I II I II II I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I 
Db 1366 G C AAAC AT TTTCCCTGT GCT AGAAGAT CAC AC T T CAGAAAAT AAAAC AGAT GAAAAAAAA 1425 

Qy 781 AT AGAA- AAAAAAAG G C ACAAAT T GT AACAGAGAAGAAT GCAAGT GT CAAGAC AT CAAAC 839 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I III I I I I I I I I I I 
Db 1426 AT AGAAGAAAGGAAGGC C CAAAT T AT AACAGAGAAGA CT AG C C C CAAAAC GT CAAAT 1482 

Qy 84 0 CCTT T C CTTAT GGCAGCACAGGAGT CTAAGACAGATTACGTTACAACAGAT CAT GT GT CA 899 

I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 83 CCTTTCCTT GT AG CAAT AC AT GAT T C C GAGGCAGAT TAT GT C ACAAC AGAT AAT TT AT C A 1542 

Qy 900 AAGGT GAC C GAGGAAGT AGT GGCAAAC AT G C C T GAAGGT CTAAC C CC AGAT T T GGT T C AG 959 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1543 AAGGT GACT GAGG CAGT AGT GGC AAC CAT G C CT GAAGGT CTAAC GCC AGAT T T AGT T C AG 1602 

Qy 960 GAAGCAT GT GAAAGT GAAT T GAAT GAAGCT ACT GGT ACAAAAAT T GC CT TT GAAACAAAA 1019 

I I M I I II I I I I I I I I II I I I I I I I I I II I I I II I I I I I I I I I I I I I I II I I I 
Db 1603 GAAGCAT GT GAAAGT GAACT GAACGAAGCCACAGGTACAAAGATT GCTT AT GAAACAAAA 1662 

Qy 102 0 ATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTT 1079 

I I I I I II II II II I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1663 GT GGACTT GGT CCAGACAT CAGAAGCTATACAAGAGT CAATTT AC CC CACAGCACAGCT T 1722 

Qy 1080 TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 1139 

I M I I I II I I I I I III I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I III 
Db 1723 T G CC CAT CAT T T GAG GAAGCT GAAGCAAC T C C GT CAC CAGT T T T GC CT GAT AT T GT TAT G 1782 

Qy 114 0 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 1199 

I I I I I II I II I I I I I I I I MM M II II I I I I I I I I II I I I I I II I I II 

Db 1783 GAAGCT C CATTAAAT TCTCTCCTTCCAAGCACT GGT GCTT CTGTAGCGCAGCC CAGT GC A 1842 

Qy 1200 T CAC CAT TAGAAAC T C TT C C TT C AGT T AATT AT GAAAG C AT AAAGT T T GAG C C T GAAAAT 1259 

II I I I I I II I I I I II I I I I I I I I II I I II I I I I I I I I I I II I II I I 

Db 1843 T C C C CACTAGAAGT AC C GT C T C C AGT T AGTT AT GAC GGT AT AAAG C T T GAGC C T GAAAAT 1902 

Qy 1260 C C C C CAC CAT AT GAG GAGGC CAT GAAT GT AT CACT AAAAAAAGAAT C AGGAAT GAAT GAA 1319 

II II II II I II I I I II I I I I I I I I I I I I I I I I II I I II I I II Ml 

Db 1903 C C C C CAC CAT AT GAAGAAGC CAT GAGT GT AGCACT AAAAA CAT C G GAC G C AAAG GAA 1959 

Qy 132 0 GAAAT C AC AGAGC CT GAAGGT AT T AGT GT AG C T GT T C AGGAAAC AGAAG C T C C T TAT AT A 137 9 

I I II I I I I I I I I I I I I II III II I I I I I I I I I I I I II I I II II I I I II I I I I 

Db 1960 GAAAT T AAAGAGC CT GAAAGTT T T AAT G C AG C T G CT C AGGAAG C AGAAG CT C C T TAT AT A 2 019 

1380 T CT AT T GCAT GT GAT T T AAT T AAAGAAACAAAGAT C T CT ACT GAAC C GAC T C C AGAT T T C 1439 

II I II I I I I I I I I I I II I I I II I I II I I I II I I I II I I I II II I I I I II III 
2 02 0 T C CAT T GCAT GT GAT T T AAT T AAAGAAACAAAG C T C T C CAC T GAG C CAAGT C C AGGGT T C 2079 



AAAGATAGT GAAAGCAGTAAT GAT GACACTT CATT T CC CAGT ACACC AGAAGCT GTAAGA 660 
II I I I I II II I I I I II I I I I I I I I I I II I I I I I I I I II I I I I III I 
AAGGAT AGT GAAAG C AGAAAT G AGAAT GCT T C T T T C CC C AGT AC C C C AGAAC T T GT G AAG 1305 



Qy 

Db 



Qy 144 0 T CT AGT TAT T C AGAAAT AG C AGAAGT T GC AC AGC C AGT GC C C GAG CAT T C T GAGCT AGT T 14 99 

I M I I I I I I I II I I I I I I I I II III II I II II M M I I I I I I I II 

Db 2080 T CTAAT TAT T C AGAAAT AGCAAAAT T T GAGAAGT C GGT AC CT GAT C ACT GT GAG CT C GT G 2139 

Qy 1500 GAAGAT TCCTCCCCC GAT T CT GAAC C AGT T GAC T T AT T T AGT GAT GAT T CAAT ACC C GAA 1559 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M Ml 

Db 214 0 GAT GAT T C CT C AC C C GAAT CT GAAC C AGT T GAC T TAT T T AGT GAT GAT T C GAT T CC T GAA 2199 

Qy 1560 GT T C CACAAAAACAAGAT GAAGC T GT AAT ACT T GT GAAAGAAAAC CT C AC T GAAAT T T C A 1619 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 2200 GT C C CACAAAC ACAAGAG GAGGC T GT GAT GC T AAT GAAGGAGAGT CT C ACT GAAGT G 2256 

Qy 1620 T CTGAGT CAATGACAGGACATGACAATAAGGGAAAACTCAGT GCTTCACCATCACCTGAG 167 9 

I I I I I I I I I I I I Ml I I M I I I I I I I I II I I I 1 I I I M I I I 
Db 2257 T C T GAGAC AGT AAC AC AAC AC AAAC AT AAGGAGAG AC T T AGT GCT T C AC CTCAGGAG 2313 

Qy 1680 G GAG GAAAAC C GT AT T T G G AGT C T T T T C AG C C C AGT T TAG G CAT C AC AAAAG AT AC CT T A 1739 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I II I I II I I I I I I 
Db 2314 GT AG GAAAGC CAT AT T T AGAGT C T T T T C AG C C CAAT T T AC AT AT T ACAAAAGAT GC T 237 0 

Qy 1740 GCAC CT GAT GAAGTT T CAGCATT GACCCAAAAGGAGAAAAT C CCT TT GCAGAT GGAGGAG 1799 

III II II II I II II I I I I I II I I I I I I I I I I III I I I I I I I I I I I I III 

Db 2371 GCAT CTAAT GAAATT CCAAC ATT GACCAAAAAGGAGACAATTT CTTT GCAAAT GGAAGAG 2430 

Qy 18 00 CT CAAT ACT G C AGT T TAT T CAAGT GAT GG CT T AT T CAT T GC T C AG GAAG CAAAC CT AAGA 1859 

I I II I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I II Ml 
Db 2431 T T T AAT AC T GCAAT T TAT T C CAAT GAT GACT T ACT T T C TT CT AAGGAAG ACAAAAT GAAA 24 90 

Qy I860 G AAAGT GAAAC AT T T T C AG AT T CAT C T C C GAT T GAGAT TAT AG AT G AGT T C C C GAC CTTT 1919 

I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M M I M 
Db 24 91 GAAAGT GAAACATTTT CC GATTCAT CT C CCATT GAGATAATAGAT GAGTT T C CCACATTT 2550 

Qy 192 0 GT C AGT T CTAAAG C AGAT T C TT CT C CT AC AT T AGC C AGGGAAT ACACT GAC CT AGAAGT A 197 9 

I I I I II I I I I II I I I I I I I I I Ml I I I I I I I I II I I I 

Db 2551 GT C AGT GCT AAAGAT GATT CT C CT AAGGAGTACACTGACCT AGAAGT A 25 9 8 

Qy 1980 GC CC ACAAAAGT GAAAT T GCT GAC AT C C AGGAT G GAGCT G GGT CAT TGGCTTGT GC AG GA 2039 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2599 TCCAACAAAAGT GAAATT GCTAATGTCCAGAGCGGGGCCAATTCGTTGCCTT GCT CAGAA 2 658 

Qy 2040 T T GC C C CAT GACC T T T CT T T CAAGAGT AT ACAAC C T AAAGAG GAAGT T CAT GT C C CAG AT 2099 

I I I I I I I II I I I II II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 659 TTGCCCTGT GACCT T T C T T T CAAGAAT AC AT AT C C T AAAGAT GAAG C AC AT GT C T CAG AT 2718 

Qy 2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 2159 

I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I M 

Db 2719 GAATTCTCCAAAAGTAGGTCCAGTGTATCTAAGGTGCCCTTATTGCTTCCAAATGTTTCT 27 7 8 

Qy 2160 GCTT T GGAT GCTCAAGCAGAGATAGGCAGCATAGAAAAAC C CAAAGTT CTT GT GAAAGAA 2219 

I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I III I I I I I I I 

Db 277 9 GCTT T G GAAT C T CAAAT AGAAAT G G G C AAC AT AGT T AAAC C C AAAGT AC T T AC GAAAGAA 2 838 

Qy 2220 GC C GAGAGAAAACT T C C T T CT G AT AC AGAAAAAGAG C GAAGAT C T C CAT CT GCT AT AT T T 2279 

II III M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I M 

Db 2839 GCAGAGGAAAAAC TTCCTTCT G AT AC AGAGAAAGAG GAC AGAT C C C T GAC AG CT GT AT T G 2898 



Qy 



22 8 0 T CAG CAGAGCT GAGT AAAAC TT C AGT T GT T GAC CT C CT CT AC T G GAGAGAC AT T AAG AAG 2339 



Db 2899 T C AGC AGAG C T GAAT AAAAC T T C AGT T GT T GAC C T C CT GT AC T GGAGAGAC AT T AAGAAG 2958 

Qy 234 0 ACTGGAGTGGTG— TTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGC 2396 

I I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I I I I || I I I I I I I I I I I I I I 
Db 2959 ACTGGAGTGGTGTATTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 3018 

Qy 2397 ATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGG 2456 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 3019 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGG 3078 

Qy 24 57. AT AT AT AAGG GT GT GAT C CAGGC T AT C C AGAAAT CT GAT GAAGG C C AC C CAT T C AGGGC A 2516 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 3079 AT AT AT AAGGGT GT GAT C CAAG C TAT C C AGAAAT CAGAT GAAG GC C AC C CAT T C AG G GCA 3138 

Qy 2517 TAT T T GGAAT CT GAAGT T G CT AT AT C T GAG G AGT T GGTT CAGAAGT AC AG CAAT T CT G CT 2576 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I 
Db 3139 TATTT GGAAT CT GAAGTT GC CAT AT CAGAGGAATT GGTT CAGAAATAT AGTAATT CTGCT 3198 

Qy 2577 CT T GGT CAT GTTAAC T GCACAAT AAAAGAAC T CAGAC GC CT CT T CT T AGT T GAT GAT T T A 2636 

I I I I I I I I I I I III I I I I I II I I I I II I I II I I II I I I I I I I I I II I I I I I I I I 
Db 3199 CT T G GT C AT GT GAACAGCACAAT AAAAGAAT T GAG GC GT C T CT T CT T AGT T GAT GAT T T A 3258 

Qy 2637 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTC 2 696 

I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 3259 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTATTTACTTACGTTGGTGCCTTGTTC 3318 

Qy 2697 AATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTAT 2756 

I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I III 
Db 3319 AAT G GT T T GACACT ACT GAT T T TAG C T CT GAT CT CACT CTT C AGT AT T C C T GT TAT AT AT 3378 

Qy 27 57 GAACGGCAT CAGGC GCAAATAGAT CATTAT CT GGGACTT GCAAATAAGAAT GTTAAAGAT 2 816 

I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 3379 GAAC GGCAT CAGGC GCAGATAGAT CATT ATCTAGGACTT GCAAACAAGAGC GTTAAGGAT 343 8 

Qy 2817 G CT AT GGC TAAAAT CCAAGCAAAAAT C C CT G GAT T GAAG CGTAAAGCT GAAT GA 2870 

II I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 3439 GC C AT GGC CAAAAT C CAAGCAAAAAT C C C T G GAT T GAAGC GCAAAG CAGAAT GA 3492 



RESULT 12 
US-10-220-891-22 

; Sequence 22, Application US/10220891 

; Publication No. US20030207286A1 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA, AKIRA 

; TITLE OF INVENTION: NUCLEIC ACID SEQUENCES HAVING CHARACTERITICS OF ENHANCED 
; TITLE OF INVENTION: EXPRESSION IN HUMAN NEUROBLASTOMA WITH FAVORABLE 
PROGNOSIS 

; TITLE OF INVENTION: BASED ON COMPARISON BETWEEN HUMAN NEUROBLASTOMA WITH 
FAVORABLE 

; TITLE OF INVENTION: PROGNOSIS AND HUMAN NEUROBLASTOMA WITH UNFAVORABLE 
PROGNOSIS 

; FILE REFERENCE: 7388-73435 

; CURRENT APPLICATION NUMBER: US/ 10/22 0 , 8 9 1 

; CURRENT FILING DATE: 2003-03-07 

; PRIOR APPLICATION NUMBER: JP 2000/140387 



; PRIOR FILING DATE: 2000-05-12 

; PRIOR APPLICATION NUMBER: JP 2000/159195 

; PRIOR FILING DATE: 2000-03-07 

; NUMBER OF SEQ ID NOS : 108 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 22 

LENGTH: 198 0 
; TYPE: DNA 

; ORGANISM: Homo sapiens 
US-10-220-891-22 

Query Match 36.9%; Score 1414.6; DB 16; Length 1980; 

Best Local Similarity 87.0%; Precl. No. 0; 

Matches 1671; Conservative 0; Mismatches 209; Indels 41; Gaps 9; 

Qy 1326 AC AGAG C CT GAAG GT AT T AGT GT AG CT GT T C AGGAAACAGAAGC T C C T TAT AT AT CT AT T 1385 

I I I I I I I I I I I I I I I I Mill! I I I I II I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 28 AAAGAGC CT GAAAAT AT T AAT GC AGCT CT T CAAGAAAC AGAAG C T C C T TAT AT AT C TAT T 87 

Qy 1386 GCAT GT GATTTAATTAAAGAAACAAAGAT CT CTACT GAAC CGACT C CAGATTT CT CT AGT 1445 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I II I I I M I I I I I I 
Db 8 8 GCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTCTCTGAT 14 7 

Qy 144 6 TATT CAGAAATAGCAGAAGTT GC ACAGCCAGT GC CC GAGCATT CT GAGCT AGTT GAAGAT 1505 

I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I II 
Db 14 8 TAT T CAGAAAT G GCAAAAGT T GAAC AG C C AGT GC CT GAT CAT T CT GAGCT AGT T GAAGAT 2 07 

Qy 1506 TCCTCCCCC GAT T CT GAAC C AGT T GACT TATT T AGT GAT GAT T CAAT AC C C GAAGT TC C A 1565 

M I I I II I I ! I I I II I I I I I I I I I I I I I I I I I I I I || I I I I I I | | | | | || | | | | | I 
Db 208 T C CT C AC CT GAT T CT GAAC C AGT T GAC T T ATT T AGT GAT GAT T CAAT AC CT GAC GT T C C A 2 67 

Qy 1566 CAAAAACAAGAT GAAGCTGTAAT ACTT GT GAAAGAAAAC CT CACT GAAAT TT CAT CTGAG 1625 

I M I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I 
Db 268 CAAAAAC AAG GT GAAAC T GT GAT GC T T GT GAAAGAAAGT C T CACT GAGACT T CAT T T GAG 327 

Qy 162 6 T CAAT GACAGGAC AT GACAAT AAGGGAAAACT C AGT GCT T C AC CAT C AC C T GAG GGAG GA 1685 

I I I M II II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 328 T CAAT GAT AGAAT AT GAAAAT AAGGAAAAACT CAGT G C T T T GC C AC CT GAG GGAGGA 38 4 

Qy 1686 AAAC CGT AT TT GGAGT C TT T T C AGC C CAGT T TAG GCAT C ACAAAAGAT AC CT T AG C AC C T 17 45 

I I I I I I I I I I I I I I I II I I I I 1 I I I I I II I I I I I I I I I I I I I I I MM 

Db 385 AAGC C AT AT TT G GAAT CT T T T AAG CT CAGT T T AGAT AACACAAAAGAT AC C C T GT T AC C T 44 4 

Qy 174 6 GAT GAAGT T T C AGCAT T GAC C CAAAAG GAGAAAAT C C CT T T G C AGAT G GAGGAGC T CAAT 18 05 

II I II I II II I I I II I II I I II I II I II I I I I I II II I I I II I I I I M I I I II I I 

Db 445 GAT GAAGT T T CAACAT T GAGCAAAAAG GAGAAAAT T C CT T T GC AGAT GGAG GAG CT CAGT 504 

Qy 1806 ACT GCAGT T TAT T CAAGT GAT G GC T TAT T CAT T G C T C AG GAAGCAAAC C T AAG AGAAAGT 18 65 



Db 




Qy 



1866 GAAAC AT T T T CAGAT T CAT CT C CGAT T GAGAT TAT AGAT GAGT T C C C GAC C T T T GT CAGT 1925 





Db 



565 GAAAC GT T T T CAGAT T CAT CT C CAAT T GAAAT TAT AGAT GAGT T C C CT AC AT T GAT CAGT 624 



Qy 



1926 T CT AAAG CAGAT TCTTCTCC T AC AT TAG C CAG GGAAT ACACT GAC CT AGAAGT AG C C C AC 19 85 
M M I I Mill I I III II II I I I I I I I I I I I I II II I I I I II II I I Mill 



625 C C T AAAAC T GAT T CAT T T T C TAAAT T AGC CAGGGAAT AT AC T GAC C T AGAAGT AT C C CAC 68 4 

1986 AAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGATTGCCC 2045 
I I I M II I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I 
685 AAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCC 744 

2046 CAT GAC CTTTCTTT CAAGAGT AT AC AAC C T AAAG AG GAAGT T CAT GT C C CAGAT 2099 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I If I | II I 

745 CAT GAC CTTTCTTT GAAGAAC AT ACAAC C CAAAGT T GAAGAGAAAAT C AGT T T CT CAGAT 8 04 

2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 2159 
M I I I I I I I I I I I III I I I I I I II II I I I I I I I I I I I I I I I I I I I I 
8 05 GACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCT 864 

2160 G CTT T GGAT G C T C AAG CAGAG AT AGGC AGCAT AGAAAAAC C CAAAGT T C T T GT GAAAGAA 2219 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II M I I I I I I I II I I I II I I 

865 GC T T T G GC C ACT C AGGC AGAGATAGAGAGCAT AGT TAAAC C CAAAGT T CT T GT GAAAGAA 924 

222 0 GC C GAGAGAAAAC TTCCTTCT GAT ACAGAAAAAGAGC GAAGAT C T C CAT CT G CT AT AT T T 227 9 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

925 GCT GAGAAAAAACTT CCT T CCGATACAGAAAAAGAGGACAGAT CACCAT CT GCTATATTT 984 

228 0 T C AGC AGAGC T GAGT AAAAC T T C AGT T GT T GAC CT CCT CT AC T G GAGAGAC AT T AAGAAG 2339 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
985 T C AGC AGAGCT GAGTAAAACT T C AGTT GT T GAC C T C C T GTAC T G GAGAGACAT TAAGAAG 104 4 

234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 2399 

I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I 
1045 ACTGGAGTGGTGTTTGGTGCCAGCCTATTCCAGCTGCTTTCATTGACAGTATTCAGCATT 1104 

24 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 2459 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
1105 GTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATA 1164 

2460 T ATAAGG GT GT GAT CC AG GCT AT C C AGAAAT C T GAT GAAGGC C AC C CAT T C AG GGCAT AT 2519 

M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I || | I I || 
1165 T AC AAG G GT GT GAT C C AAG C T AT CC AGAAAT CAGAT GAAGGCCACCCATT CAGGGCATAT 1224 

252 0 T T G GAAT CT GAAGT T GCT AT AT C T GAG GAGT T GGT T C AGAAGT AC AGCAAT TCTGCTCTT 2579 

I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
1225 CT GGAAT CT GAAGT T GCT AT AT CT GAGGAGTT GGT T C AGAAGT ACAGTAAT T CT GCT CTT 1284 

2580 G GT C AT GT TAACT G CAC AAT AAAAGAACT CAGAC GCCT CTT CTT AGTT GAT GAT TT AGTT 2639 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I 
1285 GGT CAT GT GAAC T G CAC GAT AAAG GAAC T C AG GC G CCTCTTCT T AGT T GAT GAT TT AGT T 134 4 

2640 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2 699 

I I M I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
1345 GATTCTCTGGAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAAT 14 04 

2 700 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 27 59 

I I I I I M I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
14 05 GGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 14 64 

2760 C G G CAT C AGGC G C AAAT AGAT CAT TAT CT G GGAC T T G CAAAT AAGAAT GT T AAAGAT GCT 2819 

I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1465 C G G CAT C AGGC AC AGAT AGAT CAT TAT C TAG GACT T G CAAAT AAGAAT GT T AAAGAT GCT 1524 



Qy 


2820 


AT GGCTAAAAT C CAAGCAAAAAT C C CT GG AT T G A AGC GT A A AGC T r; A A t r n r a z\ z\ r r- rT r 

1 M 1 I II 1 1 1 1 1 1 1 1 1 1 I 1 | | | I | | | | | | M | | | | | | | | | | | M | | | | | | | | 'M 

ATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCA 


9P7Q 


Db 


1525 


ICQ/1 

1 0 0 4 


Qy 


2880 


AAAGAGTTAACAATAGAGGAGTTTATCTTTAAAf^GGGATATTrATTTr attpp n. r r r rrr t rr 

1 1 1 1 1 II | | | | | | | | | | | | | | | | | | | | || | | | | I N | | | | | | | | | | 

AAATAATTA GT AG GAGT T C AT CT T T AAAGGG GAT AT T CAT T T GAT TAT AC G GG GG 


OQOQ 


Db 


1585 


l bo y 


Qy 


2940 


AGGGTCAGGGAAGAACAA AGP CTT(^ AT ATT r^r A riTf^r Af^TTTTAr a/~7vt 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I || || || | | | | | || | | | | | | MM 
AG GGT C AGG GAAGAAC GAA- C C T T GAC GT T G C AGT G C AGT T T C AC AGAT C GT T GT T AGAT 


z y o o 


Db 


1640 


1698 


Qv 


2989 


CTTTATTTTTAGCAACGrAGTG— TPTH Af;PA AAA ATP ArrTPTrT 1 Tr'aPT' rrrrrn rrnrrprp 

1 1 1 1 1 1 1 1 1 M 1 1 1 III || | | | | | | || | | 1 1 II II 1 1 1 1 1 1 1 | | | II 1 1 M 1 1 

CT T TAT T T T TAG C CAT GC ACT GT T GT GAGGAAAAAT T AC C T GT C T T GAC T GC C AT GT GT T 


j04 / 


Db 


1699 


1 /DO 


Qy 

Db 


3048 
1759 


CATCATCTTAAGTATTGTAAGCTGCTATGTATr;r;ATTTAAATrnTAATr , aTZi r r f r r rr'T'T"p 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 II 1 III 

CATCATCTTAAGTATTGTAAGCTGCTATGTATGGGTTTAAACCGTAATCATATCTTTTTC 


olUo 

1 Q 1 O 

lo 18 


Ov 


3107 


-TTfCTfiT ATf^ APr^r* APTf^PTP, A AT AZ\z\paa 2\ra r TPTrarniiarr r T i r r rArpArrrTi7\r'7\r'mmrn 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

ui/\x LiA(jCjL.AU1 Cj(j1(jGA ATAAAAAACCTGTATATTTTACTTT 


3165 


Db 




1867 


Qy 


3166 


GT CG CAGGT AGT C T T G C T GT AT - T T G GGGAAT T GCAAAGAAAGT GGAGCT GAC AGAAAT A 

1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M Mill 

GT T G CAGAT AGT C T T GC C GCAT C T T G GCAAGT T GCAGAGAT GGT GGAGCT AGAAAAAAAA 


3224 


Db 


1868 


1927 


Qy 


3225 


A 3225 




Db 


1928 


1 

A 1928 





RESULT 13 
US-10-060-036-54 

Sequence 54, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes, Michael J. 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/060, 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 54 
LENGTH: 2235 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-060-036-54 



Query Match 26.8%; Score 1025.4; DB 15; Length 2235; 

Best Local Similarity 86.7%; Pred. No. 5.2e-249; 

Matches 1361; Conservative 0; Mismatches 131; Indels 77; Gaps 18; 



Qy 


2304 


GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGT^TTTf^T^rrArr 

MM 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | , | | | 

GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 


z O DO 


Db 


697 


/DO 


Qy 


2364 


1 M M M 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 Mill 1 1 1 1 1 II 1 I I I I 

CTAT TCCTGCTGCTTT CAT T GAC AGT AT T CAG CAT T GT GAGCGT AAC AG C C T AC AT T GC C 




Db 


757 


o Id 


Qy 


2424 


TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAr^rcTrTr aTP^Arrr^ATr 

M 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 | | | | | || 1 MINI 

TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 


O A O O 


Db 


817 


Q 1 tZ 
O / D 


Qy 


2484 


CAGAAAT CT GAT GAAGGC CACC CATT C AGGGC AT ATT T cc A AT r t r a. a r tt r rT a. t> a t n m 

M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | I 1 1 1 1 1 1 1 1 1 1 1 

CAGAAAT C AGAT GAAGGC C AC C CAT T C AGGGC AT AT CT GGAAT CT GAAGT T GC T AT AT CT 


zo4 3 


Db 


877 


936 


Qy 


2544 


GAGGAGTTGGTTCAGAAGTACAGCAATTnTGrTPTTf^Tr athtta arTrrarflflrpss a n 

1 1 1 1 1 1 II 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 || | | || | | | | Mill 

GAGGAGT T GGT T CAGAAGT ACAGT AAT TC T GCTCTTGGT CAT GT GAAC T G C AC GAT AAAG 


zfoUo 


Db 


937 


996 


Qy 


2604 


GAACTCAGACGCCTCTTCTTAGTTG AT fJATTT A f^TT^: ATT r*TPTr Anr r r r r r rrr' nrrvrrnrnn 

M 1 M 1 M 1 1 II II 1 II 1 1 1 1 1 M 1 II 1 M 1 II 1 1 M 1 1 1 1 1 II 1 1 II II 1 1 1 II 1 1 M 

GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 




Db 


997 


lUOD 


Qy 


2664 


ATGTGGGTATTTACCTATGTTGGTGrrTTt^TTPAATr^r^TrTrArzirTZirTZi z\ r r r r | -PT , ("("r , 'T' 

1 1 1 M II 1 II 1 M 1 1 1 II 1 II II M 1 1 1 1 1 II 1 1 1 1 1 1 || || | | | | || | I I | || M II 

ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 


O T O O 


Db 


1057 


T 1 T C 


Qy 

Db 


2724 
1117 


CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAArr^r ATr Afzrrrr a a aTar-aTr'aT 

II M II II 1 1 II II II 1 1 1 1 II II II 1 1 1 1 1 II 1 II Ml 

C T CAT T T CAC T C T T C AGT GT T C C T GT T AT TT AT GAAC GGCAT CAG GCAC AGAT AGAT CAT 


OIQ'J 

Z / o J 
11 / b 


Qy 


2784 


T AT CT G GGAC T T GCAAAT AAGAAT GT T AAAGAT GC T A T nar T A A A A T r r a a r r a a a a a t r 

1 1 1 1 1 1 1 1 1 1 1 M II | || | | || | || | | | | | | || | | | M II 1 1 II 1 II 1 1 1 1 1 1 1 

TAT C T AGGAC T T G CAAAT AAGAAT GT T AAAGAT GCT AT G GCT AAAAT C CAAGCAAAAAT C 


O Q A T 


Db 


1177 


1 Z s5 D 


Qy 


2844 


C CT GGATT GAAGCGTAAAGCT GAAT GAGAAAGC CT GAAAG AGT T A A C A A T A a A r r A r;T TT 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 II | | | | || 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

C C T G GAT T GAAGC GCAAAG CT GAAT GAAAAC GC C CAAAAT AAT T A GTAGGAGTTC 


Z U J 


Db 


1237 


1 9 Q 1 
X Z _? X 


Qy 


2904 


AT C T T T AAAGG G GAT AT T CAT T T GAT T C CAT T G G GGAG G GT C AGG GAAGAACAAAG C C TT 

M 1 1 II 1 II 1 1 II 1 II 1 1 | | | | M 1 1 1 1 1 1 1 | 1 II M 1 II II II 

AT CT T T AAAG GGGAT AT T CAT T T GAT TAT AC GG G G GAG G GT CAGGGAAGAAC GAA- C C TT 


2963 


Db 


1292 


1350 


Qy 


2964 


GACATTGCAGTGCAGTTTCAC AGAT C T T T AT TT T TAG CAAC GC AGT G - T 

IN 1 1 1 1 1 1 Mill Mill Mill 1 III II 1 

GAC GT T GC AGT GC AGTT T CAC AGAT C GT T GT T AGAT C T T TAT TT T T AGC C AT GC ACT GT T 


3011 


Db 


1351 


1410 


Qy 


3012 


CTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 

INI II 1 II | | | | || || || II 1 II 1 1 I I I I || 1 II 1 1 1 1 1 M 1 II II 1 1 1 M 

GT GAG GAAAAAT T AC CT GT C T T GACT G C CAT GT GT T CAT C AT CT T AAGT AT T GT AAG C T G 


3071 


Db 


1411 


1470 



3072 



C T AT GT AT G GAT T TAAAT C GTAAT CAT AT TTGTTTTTCCT GT AT GAGG C AC T G GT GAAT A 3131 
N I II I II II II I I I I I II I II I I I II I I I I I I I I I I I I I I I I | | I I I I I I 
1471 CT AT GT AT GGAT TTAAAC C GTAAT CAT A — TCTTTTTCC TAT C T GAGGCAC T G GT GGAAT 1528 



3132 



AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTG 3190 

n „ Q [I MINIMI ! M i I I I MM MINI Ill 

152 9 AAAAAAC CTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTG 



1577 



3191 GGGAATTGCAAAGAAAGTGGAGCT GACAGAAAT AAC C C T T T T C AC AGT T T GT G C 324 4 

' Ml MINIM I I Ml II I || I I I I I I i I I I 

1578 G CAAGT T G C AGAGAT G GT G GAG C T AGAAAAAAAAAAAAAAAAG C C CT T T T C AGT T T GT GC 1637 



3245 ACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAA TGTTTAGACG 33 00 

'''Mill MM I I II II II II I II I I I I Ml II N I II I I I 

AC T GT GT AT G GT C C GT GT AGAT T GAT GC AGAT T T T CT GAAAT GAAAT GT T T GT T T AGAC G 



1638 



1697 



3301 AGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTA 3360 

111 ' MINI III INN | || | || || | || || || || 

1698 AGAT CAT AC C GGT AAAGC AGGAAT GACAAAGC T TG-CTTTTCT GGT AT GT T C TAG GT GT A 1756 

3361 TT GT GAAAT TT AC T GT T GTAT TAAT T GC CAAT ATAAGTAAAT AT AGAT TAT AT AT AT C T A 3420 

"MM I II II I || | || | | || || || || || || | || || | || || || | || 

1757 TT GT GACTTTTACT GTTATATTAAT T GCCAATATAAGTAAATATAGATT ATA TA 1810 

3421 TATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATA 348 0 

' " M I M I II II I I II I I || M I I I I INN I II II I N I II II I I I I 

1811 T GTAT AGT GT T T C ACAAAGCT T AGAC CT T T AC C TT - C CAG C C AC C CC AC AGT G C TT GAT A 18 69 

3481 CT T C T GT CAT GG GTTT T AT GT GT GTAGT C C CAAAG C AC ATAAG C T AGG GAGAAA 3534 

' I I N I II I I I I N II II I I || || || | || || | | || || || | || 

T T T C AGAGT CAGT CAT T G GT TAT AC AT GT GT AGT T C CAAAGC AC ATAAGC T AGAAGAAGA 



1870 
3535 



1929 



C GT ACT T CT AGGC GC AC T AC CAT C T GT T T T CAACAC GAAC C GAC G C CAT G CAAAC AGAAC 3594 

" M II II I II II | | | | | | | | | || I II I INN II I I II II 

1930 AATATT T CTAGGAGCACTACCAT CT GTTTTCAACAT GAA AT GCCACACACAT AGAAC 1986 

3595 T C C - TC AAC AT AAACT T C ACT GCAC AGACT T AC T GTAGT TAAT T T TAT C AC — AAACT C T 3651 

'" " II II I I I II I | || || || || | | || | | || || Mill 

1987 T C CAACAACAT CAAT T T CAT T G CAC AGACT GACT GTAGT TAAT T T T GT C AC AGAAT CT AT 2046 

3652 G GAC T GAAT CTAAT GCT T C CAAAAA T G T T T G C AAAT AT CAAAC AT T G T TAT GT 3704 

' 1 " I I II 1 I I || || | | || 11 I II I || || || || | | || I I II I I I II II 

2 047 GGACT GAAT CTAAT GCTTCCAAAAATGTT GTTT GTTT GCAAATATCAAACAT TGTT AT GC 2106 

37 05 AAGAAAATAT AAAT GAC GAT T TAT ACAAT T GT G GTT TAAG CT GTAT T GAACT 3756 

0 _ N I I II II I || | | | | || II II II I I II I I II II I II N II I I II I I II 

2107 AAGAAATT AT TAAT T AC AAAAT GAAGAT T TAT AC CAT T GT GGT T TAAGC T GT AC T GAACT 2166 

3757 AAAT C T GT G GAAT GC AT T GT GAAC T GTAAAAGC AAAGT AT CAATAAAGCT TAT AGACT T A 3816 

01 „ 111 I> I II MM Mill || || Mill Nil I I IN || || |M I I II II II I II 

2167 AAAT C T GT GGAAT GC AT T GT GAACT GTAAAAG CAAAGT AT CAAT AAAG C T TAT AGAC T T A 2226 

3817 AAAAAAAAA 3825 

N I II I II I 
2227 AAAAAAAAA 2235 



RESULT 14 

US-09-823-245A-349 

Sequence 349, Application US/09823245A 
Publication No. US2002 00397 60A1 
GENERAL INFORMATION: 
APPLICANT: Wong, Gordon G. 
APPLICANT: Clark, Hilary 
APPLICANT: Fechtel, Kim 
APPLICANT: Agostino, Michael J. 
APPLICANT: Howes, Steven H. 
APPLICANT: Resnick, Richard J. 
APPLICANT: Gulukota, Kamalakar 
APPLICANT: Graham, James R. 
APPLICANT: Genetics Institute, Inc. 

TITLE OF INVENTION: POLYNUCLEOTIDES ENCODING NOVEL SECRETED PROTEINS 
FILE REFERENCE: GIN 64 01 

CURRENT APPLICATION NUMBER: US/ 09/ 823 , 24 5A 
CURRENT FILING DATE: 2001-03-29 
PRIOR APPLICATION NUMBER: 60/194,941 
PRIOR FILING DATE: 2000-04-06 
NUMBER OF SEQ ID NOS : 631 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 349 
LENGTH: 1514 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-823-245A-349 

Query Match 25.8%; Score 989.2; DB 13; Length 1514; 

Best Local Similarity 85.7%; Pred. No. 6.2e-240; 

Matches 1323; Conservative 0; Mismatches 143; Indels 78; Gaps 17; 

AGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCA 2394 
I I I I I I I I I I I I I I I I I | | I I I I I I I I I | I I I I I I | | M I I II I I II I I I | | | | | 
AGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCA 60 

GCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTA 24 54 

MMIIMM IMM IIIIIIIIIIMIMMIIIIIIIIIMIMI MIIIIIMI 

GCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTA 12 0 
G GAT AT ATAAGGGT GT GAT C C AG G CT AT C CAGAAAT CT GAT GAAGG C C AC C CAT T CAGGG 2514 

1111111 MMMM Illlllllllll I M I I I I I I I I I I I | | | M I I I 

GGAT AT ACAAG G GT GT GAT C CAAGC T AT C CAGAAAT C AGAT GAAGGC C AC C CAT T CAG G G 1 8 0 
CAT AT T T GGAAT C T GAAGT T G CT AT AT CT GAGGAGTT GGT T C AGAAGT AC AGCAAT T C T G 2574 

MIM I II I I I I I II II || | | | | | | | | | | | | | | | || | || || | | | || | | | | | | | 

CATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTG 24 0 
CTCTTGGT CAT GT T AAC T G CACAAT AAAAGAAC T CAGAC G C CT CTT CT T AGT T GAT GAT T 2634 

M MM MMMM MIM MMMM I MIMMMMIMM 

CTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATT 300 
TAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGT 2 694 

M M M M M II II I I I I II || || | | | | | || || | | || | | || | | | || | | || | , , | | | || | | 

TAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGT 360 



Qy 


2335 


Db 


1 


Qy 


2395 


Db 


61 


Qy 


2455 


Db 


121 


Qy 


2515 


Db 


181 


Qy 


2575 


Db 


241 


Qy 


2635 


Db 


301 


Qy 


2695 



TCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATT 1 



T 2754 



, 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ill 

361 TTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTT 420 

2755 AT GAAC GG CAT C AGGC G C AAAT AGAT CAT TAT CT G G GAC T T G CAAAT AAGAAT GT T AAAG 2 814 
I I I I I II I I I I I I I I | I I | || | || || | | || | | || || || || || || || || , || | | | , || 
AT GAAC G G CAT C AG G C AC AGAT AGAT CAT TAT C T AGGACT T GCAAAT AAGAAT GT TAAAG 4 8 0 



421 
2815 

481 
2875 



ATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGAGAAA 2 874 
I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | i 
AT G CT AT GGC TAAAAT C CAAG CAAAAAT C C C T GG AT T GAAGC G CAAAGCT GAAT GAAAAC 540 



G C CT GAAAGAGT TAACAAT AGAG G AGT T TAT C T T TAAAG GG GAT AT T C AT TT GAT T C CAT 2934 

111 I I I I M I | || || || | || || || | || | | | || M || | || || | | , 

541 GCCCAAAATAATTA GT AG GAGT T CAT C T T T AAAGGG GAT AT T C AT TT GAT TAT AC 595 



2984 



2935 TGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 

I I I I I II I I I I M M II M I II I I I I I I I | | | | | | | | || | | | | | | | 
596 G GGG GAGG GT CAG GGAAGAAC GAA- C C TT GAC GT T G C AGT G C AGT T T C ACAGAT C GTT GT 654 

2985 - AGAT CTT TAT T T T TAG CAAC GC AGT G- T CT GAG GAAAAAT GAC CT GT CT T GAC T GC C CT 3042 

I I I I I I I I I I I I I I I I || | | || | | | | || I I II I I I I II I II I I I II I 

T AGAT C TT T ATT T T T AGC C AT G C ACT GT T GT GAGGAAAAAT T AC C T GT CT T GACT GCC AT 



655 
3043 
715 



714 



GT GT T CAT CAT C TTAAGT AT T GTAAGC T GCT AT GT AT G GAT T TAAAT C GTAAT CAT AT T T 3102 

I I I I I I I I I I I I I I I I I I I I I I I I | | | M I I I I I I I I M | | | | | I I I I I I I I 

GT GT T CAT C AT CTTAAGT AT T GTAAG CT G CT AT GT AT GGAT T TAAAC C GTAAT CAT AT C T 774 



3103 GTTT TTCC T GT AT GAG GC AC T GGT GAATAAACAAAGAT C T GAGAAAGCT GTAT AT T AC 3160 

III I I I I M I I I I I I M I I I I I I Ml MINIMI 

775 TTTTCCTATCTATCTGAGGCACTGGTGGA AT AAAAAAC C T GTAT AT T T T 823 



3161 ACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCTG 3215 

' I I I I I I I I I I M I M I I II I II I I I I I I I I II I M II I I I M I 

AC T T T GT T G C AGAT AGT CTT GC C GC AT C T T GG CAAGT T GC AGAGAT GGT G GAG CT AGAAA 



824 



883 



3216 — ACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCA 3273 
1 I I N I I I I I I I I I I M M I M I II I II II I I II II || | | | | M I I 
AAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCA 94 3 



884 



3274 GATTTTCTGAAATGAAA T GTTTAGAC GAGAT CAT GC CAC CAAGGCAGGAGT GAAAA 3329 

I I I I I I I I I I I I I I I I I II II II I I I I I I I II || || M MUM Ml II 

GATTTT CT GAAAT GAAAT GT T T GTTTAGAC GAGAT CATACC GGTAAAGCAGGAAT GACAA 



944 
3330 



AGCTT GCCTTTCCTGGTAT GTT CTAGGTGTATTGTGAAATTTACT GTT GTAT TAATT GCC 

I ' I I I I M I I II II II I M I II M I I I II I II I I I IM I II II I I II I I I 



1003 
3389 



1004 AGCTTG-CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCC 1062 
3390 AATATAAGTAAATATAGATT ATATATAT CTATATATAGT GTTT CAC GAAGCTTAGCCCTT 344 9 

m« iiii!I™ llliMIIIIIM 111 111 II II MM II I I I ] I I I ! MM 

1063 AATATAAGTAAAT ATAGAT TATA TAT GTAT AGT GT T T C ACAAAG C T T AGAC CT T 1116 

3450 TACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGTGT 3503 

llllll Mill I I I I II I II II II I I I I I II Mill MM II Ml 

1117 TACCTT- CCAG C CAC C C CAC AGT G C T T GAT AT TT C AGAGT C AGT CAT T GGT T AT ACAT GT 1175 

3504 GTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTAC CAT CT GTTT 3563 
Mill MMMMIIMM M I I I I I j I Mini 



Db 


1176 


Qy 


3564 


Db 


1236 


Qy 


3624 


Db 


1291 


Qy 


3677 


Db 


1351 


Qy 


3727 


Db 


1411 


Qy 


3787 


Db 


1471 



1176 GT AGT T C CAAAGC AC AT AAGC T AGAAGAAGAAAT AT T T C TAG GAG C AC T AC CAT CT GT T T 1235 
T CAAC AC GAAC C GAC G C CAT GC AAAC AGAACT C C T CAAC AT AAAC T T C AC T G C AC AGAC T 3623 

111111111 i i ii in i i iiiiii i imiimim 

T CAAC AT GAAAT G C C - - AC AC AC AT AGAAC T C CAAC AT C AAT T T CAT T G C AC AG AC T 1290 
T AC T GT AGT TAAT T T TAT C AC — AAAC T C T G GACT GAAT CTAAT G C T T C CAAAAA 367 6 

I I I I I I I II II I I II II I I II I I | | M | | | | | | | | | || || | | , 

GACT GT AGT TAAT T TT GT C AC AGAAT CT AT GGAC T GAAT C TAAT G CT T C CAAAAAT GT T G 1350 
T GT T T G CAAAT AT CAAAC AT T GT T AT GT AAGAAAAT AT AAAT GAC GAT T T 372 6 

M I II I II II II II I I I II I I I II I I I IIIIII IN MMii M ||| 

T T T GT T T GCAAAT AT CAAAC AT T GT TAT G CAAGAAAT TAT TAAT T ATAAAAT GAAGATT T 1410 
AT ACAAT T GT GGT T T AAGC T GT AT T GAAC TAAATCTGTG GAAT GC AT TGT GAAC TGT AAA 37 8 6 

<NI I I I I I I II I I I I I I II II I I II I I II I | | | | | | | | | || | || | | | | || || 

ATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAA 14 7 0 
AGCAAAGT AT CAATAAAGCT TAT AGAC T T AAAAAAAAAAAAAAA 383 0 

I I M I M I I I I II I I I I I I I I I II I I I MINIM I III 



RESULT 15 
US-10-466-258-10 

; Sequence 10, Application US/10466258 

; Publication No. US20040132096A1 

; GENERAL INFORMATION: 

; APPLICANT: GLAXO GROUP LIMITED 

; TITLE OF INVENTION: ASSAY 

; FILE REFERENCE: P80966 GCW 

; CURRENT APPLICATION NUMBER: US/ 10/ 4 66 , 25 8 

; CURRENT FILING DATE: 2003-07-15 

; NUMBER OF SEQ ID NOS : 13 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 10 

LENGTH: 1798 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 

NAME/ KEY: CDS 

LOCATION: (215) . . (814) 
US-10-466-258-10 



Query Match 25.7%; Score 986; DB 17; Length 1798; 

Best Local Similarity 86.1%; Pred. No. 4.5e-239; 

Matches 1360; Conservative 0; Mismatches 140; Indels 80; Gaps 21; 

Qy 2304 GT T GT T GAC CT C C T C TACT G GAGAGAC AT TAAGAAGAC T G GAGT GGTGTTTGGT GC C AG C 2363 

_ 111111 I I I I M I M I I II M I II II M I II M I II I I I I I II I I 

Db 248 GT T GT T GAC CT C C T GT ACT G GAGAGAC AT TAAGAAGAC T G GAGT GGT GT T T G GT G C C AGC 307 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

OAO 1111 ''I f I I I I M I I M II I II I I II I I I I II || II I II I I 

Db 308 C TAT TCCTGCTGCTTT CAT T GAC AGT AT T C AGCAT TGT GAG C GT AAC AG C C T AC AT T GC C 3 67 



Qy 



2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 



368 
2484 

428 
2544 

488 
2604 

548 
2664 

608 



N N I II I II I I I I II II II I I I I I I I I I I I I I | | | | | | | | | | | || | | | | | | | | | | | 
TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 427 



CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 
I I I I I I I I I I I I I I I I I I I I II I I I II I M II I I I I I I I I I I I | M I I I I I I I I I I I I 
CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 



2543 
487 
2603 



GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 
HIMI I II I I II I I I I I || | | | | | | | | | | | | || | | | | | | , | | 

GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 547 

GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 
I I I I I I I I II I II I I I I I I I I I I | || || | | | | | | | | | | | || | | || | | | | | , | | | | | | | | 

GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 607 



ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 
1 11 I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I || | | | | | || | | | | | | | | | | M I I 
ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 667 

2724 CTGATTTCACT-CTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCA 2782 

_ n 11 MMIIII I I N I I I I I I I I I I I I I I I I I I II I I I I I I II I I | | | M I 

668 CTCATTTCACTCCTTCAGTGTTCCTGTTATTTA-GAACGGCATCAGGCACAGATAGATCA 726 



2783 
727 



TTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAAT 2842 

1 N 'I I I I I I I I I I I I I I I I | | || | | | | | M I I I I I I I I I I || | | | | | | | | | | | 

TTATCTAGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAAT 786 



2843 CCCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTT 2902 

MIN M I I I I I I I I I I II I II II I | M I III I I I I I I I 

78 7 CCCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTT 841 

2903 TATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCT 2962 

MINIM II I || I II II I II I I I I I I I I I II | | | | || M I I I II II I 

842 CATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA-CCT 900 

2963 TGACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG- 3010 

N I I I I I I II M I I M I I I in I I M I I M I III II 

TGACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGT 960 



901 
3011 
961 



TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCT 307 0 
1 NIIMIMM MIMMMMIMM | | || | | || || || | || || ,| || | | | | , | | | 

TGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCT 102 0 



3071 GCTATGTATGGATTTAAATCGTAATCATATTTGTTT — TTCCTGTATGAGGCACTGGTGA 3128 

1 l! I I I I I I I I I I I | || | | | | || | || | | ,| | | | | I I I II I I M I 

GCTATGTATGGATTTAAACCGTAATCATATCTTTTTCCTATCTATCTGAGGCACTGGTGG 



1021 



1080 



3129 ATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT- 318 7 

m«i Lpll.il' I I N M III II Mil I II II | | | M II I I I II 

1081 AATAAAAAAC CTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATC 1129 



3188 TTGGGGAATTGCAAAGAAAGTGGAGCT G AC AG AAAT AAC C C T T T T C AC AGT T T G 3241 

NN I NMI Ml I I I I I I I I I I Ml II I II | | M | | | 

TTGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTG 



1130 



1189 



3242 TGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAA TGTTTAG 3297 

N N I I I I I I I M | | || | || I II I II I I II II || I II M I I II I II I I I I I I I 



Db 


1190 


Qy 


3298 


Db 


1250 


Qy 


3358 


Db 


1309 


Qy 


3418 


Db 


1364 


Qy 


3478 


Db 


1422 


Qy 


3532 


Db 


1482 


Qy 


3592 


Db 


1538 


Qy 


3649 


Db 


1598 


Qy 


3702 


Db 


1658 ' 


Qy 


3754 , 


Db 


1718 i 


Qy 


3814 ■: 


Db 


1778 ( 



1190 TGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAATGAAATGTTTGTTTAG 12 4 9 

ACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGT 3357 
' 1 1 1 1 1 1 1 1 1 11 N I I I I I I III MINIM II M II I II I I I II I II II I 
ACGAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG-CTTTTCTGGTATGTTCTAGGT 1308 

GTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATAT 34 17 
11111111 M I I II I I I M | | || I I I II I I I || I I M II II II I I I II II I 
GTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGTAAATATAGATTATA 1363 

CTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTG 3477 

111 N II II M I I II I I II I I II I I II I II II I I I II II I I I 

-TATGTATAGTGTTTCACAAAGCTTAGACCTTTACCTT-CCAGCCACCCCACAGTGCTTG 1421 

ATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAG 3531 

111 1 N I I I I M I I I I II II II II I I I II I I I II II II 

ATATTTCAGAGTCAGTCATTGGTTATACATGTGTAGTTCCAAAGCACATAAGCTAGAAGA 14 81 

AAACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAG 3591 

1 I HM II | | || | || I I II II II II II I I I | | || | | | I II 

AGAAATATTTCTAGGAGCACTACCATC-GTTTTCAACATGAA ATGCCACACACATAG 1537 

AACTCC-TCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACA — AAC 364 8 

1 1 1 1 1 I > I I I II II MM II I I I II I I I I II II I II I II I I I I II II I | | 

AACTCCAACAACATCAATTTCATTGCACAGACTGACTGTAGTTAATTTTGTCACAGGATC 1597 

TCTGGACTGAATCTAATGCTTCCAAAAA TGTTTGCAAATATCAAACATTGTTA 3701 

I I I I M II I II II II I I M I II I I I I I M I II I II II II II II I M II I I I I 
TATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTGCAAATATCAAACATTGTTA 1657 

TGTAAGAAAATAT AAAT G AC GAT T T AT AC AAT T GT G G TT T AAG C T GT AT T GA 3753 

II 111111 "I || | | I I II I I II II I I II I I || I I II I I II I II III 
TGCAAGAAATTATTAATTACAAAATGAAGATTTATACCATTGTGGTTTAAGCTGTACTGA 1717 

ACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGAC 3813 
IMMMMMIMI IIIMM I II I II I II II I II || II I I I II II II I I I II II II 
ACTAAATCTGTGGAAGGCATTGTAAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGAC 1777 

'AAAAAAAAAAAAAAAAAA 3833 
M I I II M II I I II I I 
AAAACGAAAAAAAAAAAA 1797 



Search completed: September 11, 2004, 16:03:53 
Job time : 1639.06 sees 



